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Introduction 



This book is one of the subproducts of my dissertation. If its aim had to be summarized 
in a single sentence, it could be algebraization of graph grammars. An equivalent one 
would be study of graph dynamics. 

From the point of view of a computer scientist, graph grammars are a natural gener- 
alization of Chomsky grammars for which a purely algebraic approach does not exist up 
to now. A Chomsky (or string) grammar is, roughly speaking, a precise description of 
a formal language (which in essence is a set of strings). On a more discrete mathemati- 
cal style, it can be said that graph grammars - Matrix Graph Grammars in particular - 
study dynamics of graphs. Ideally, this algebraization would enforce our understanding of 
grammars in general, providing new analysis techniques and generalizations of concepts, 
problems and results known so far. 

In this book we fully develop such theory over the field GF(2) which covers all graph 
cases, from simple graphs (more attractive for a mathematician) to multidigraphs (more 
interesting for an applied computer scientist). The theory is presented and its basic 
properties demonstrated in a first stage, moving to increasingly difficult problems and 
establishing relations among them: 

• Applicability, for which two equivalent characterizations (necessary and sufficient 
conditions) are provided. 

• Independence. Sequential and parallel independence in particular, generalizing pre- 
viously known results for two elements. 
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• Restrictions. The theory developed so far for graph constraints and application 
conditions is significantly generalized. 

• Reachability. The state equation for Petri nets and related techniques are extended 
to general Matrix Graph Grammars. Also, Matrix Graph Grammars techniques are 
applied to Petri nets. 

Throughout the book many new concepts are introduced such as compatibility, coher- 
ence, initial and negative graph sets, etcetera. Some of them project interesting insights 
about a given grammar, while others are used to study previously mentioned problems. 

Matrix Graph Grammars have several advantages. First, many branches of math- 
ematics are at our disposal. It is based on Boolean algebra, so first and second order 
logics can be applied almost directly. They admit a functional representation so many 
ideas from functional analysis can be utilized. On the more algebraic side it is possible 
to use group theory and tensor algebra. Finally, category theory constructions such as 
pushouts are available as well. Second, as it splits the static definition from the dynamics 
of the system, it is possible to study to some extent many properties of the grammar 
without the need of an initial state. Third, although it is a theoretical tool, Matrix Graph 
Grammars are quite close to implementation, being possible to develop tools based on 
this theory. 

This introductory chapter aims to provide some perspective on graph grammars in 
general and on Matrix Graph Grammars in particular. In Sec. 1.1 we present a (partial) 
historical overview of graph grammars and graph transformation systems taken from 
several sources but mainly from [36] and [22]. Section 1.2 introduces those open problems 
that have guided our research. Finally, in Sec. 1.3 we brush over the book and see how 
applicability, sequential independence and reachability articulate it. 

1.1 Historical Overview 

Research in graph grammars started in the late 60's [66] [69], strongly motivated by prac- 
tical problems in computer science and since then it has become a very active area. 
Currently there is a wide range of applications in different branches of computer science 
such as formal language theory, software engineering, pattern recognition and genera- 
tion, implementation of term rewriting, logical and functional programming, compiler 
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contruction, database design and theory, visual programming and modeling languages 
and many more (see [23] for references on these and other topics). 

There are different approaches to graph grammars and graph transformation sys- 
tems. 1 Among them, the most prominent are the algebraic, logical, relational and set- 
theoretical. 




p* 

Fig. 1.1. Main Steps in a Grammar Rule Application 

The main steps - some of which are summarized in Fig. 1.1 - in all approaches for 
the application of a grammar rule p : L — » R to a host graph G (also known as initial 
state) to eventually obtain a final state H are almost the same: 

1. Select the grammar rule to be applied (p : L — > R in this case). In general this step 
is non-deterministic. 

2. Find an occurrence of L in G. In general this step is also non-deterministic because 
there may be several occurrences of L in G. 

3. Check any application condition of the production. 

4. Remove elements that appear in L but not in R. There are two possibilities for 
so-called dangling edges: 2 

a) Production is not applied. 

b) Dangling edges arc deleted too. 

1 The only difference between a grammar and a transformation system is that a grammar 
considers an initial state while a transformation system does not. 

2 Edges not appearing in the rule specification that are incident to one node to be eliminated. 
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If the production is to be applied, the system state changes from G to G\ (see Fig. 
1.1). 

5. Glue R with G\. The system state changes from G\ to H (see Fig. 1.1). 

Now wc shall briefly review previously mentioned families of approaches. The so-called 
algebraic approach to graph grammars (graph transformation systems) is characterized 
by relying almost exclusively on category theory and using gluing of graphs to perform 
operations. It can be divided into at least three main sub approaches, depending on the 
categorical construction under use: DPO (Double PushOut, see Sec. 3.1), SPO (Single 
PushOut, see Sec. 3.2), pullback and double pullback (also summarized in Sec. 3.2). We 
will not comment on others, like sesquipushout for example (see [9]). 

DPO was initiated by Ehrig, Pfender and Schneider in the early 70's [21] as a gener- 
alization of Chomsky grammars in order to consider graphs instead of strings. It seems 
that the term algebraic was appended because graphs might be considered as a special 
kind of algebras and because the pushout construction was perceived more as a concept 
from universal algebra than from category theory. Nowadays it is the more prominent 
approach to graph rewriting, with a vast body of theoretical results and several tools for 
their implementation. 3 

By mid and late 80's Raoult [67], Kennaway [41] [42] and L6wc [48] developed SPO 
approach probably motivated by some "restrictions" of DPO, e.g. the usage of total 
instead of partial morphisms. Raoult and Kennaway were focused on term graph rewriting 
while Lowe took a more general approach. 

In the late 90 's a new approach - although less prominent for now - emerged by 
reverting all arrows (using pullbacks instead of pushouts), proposed by Bauderon [5]. It 
seems that, in contrast to the pushout construction, pullbacks can handle deletion and 
duplication more easily. 

DPO has been generalized recently through adhesive HLR categories, which is sum- 
marized in Sec. 3.2 (we are not awared of a similar initiative for SPO or pullback). For 
a detailed account see [22]. Instead of just considering graphs, all main ideas in DPO 
can be extended to higher level structures like labeled graphs, typed graphs, Petri nets, 

3 For example AGG - see [73] or visit http://tfs.cs.tu-berlin.de/agg/ - and AToM 3 - see 
[44] or visit http://atom3.cs.mcgill.ca/ -. 
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etcetera. This is firstly accomplished in [16] and [17], starting the theory of HLR systems 
(high level replacement systems). Independently, Lack and Sobocihski in [43] introduced 
the concept of adhesive category and in [18] both were merged to get adhesive HLR 
categories. 

In this book we shall refer to these approaches as categorical, to distinguish from ours 
which is more algebraic in nature. 

The so-called set-theoretic approach (sometimes also known as algorithmic approach) 
substitutes one structure by another structure, either nodes or edges. There are two sub- 
families, node replacement and edge replacement (also hyperedge replacement), depending 
on the type of elements to be replaced. Node replacement (edNCE) was introduced in 
[54] [55] and further investigated in many papers. It is based on connecting instead of 
gluing for embedding one graph into another. Many extensions and particular cases have 
been studied so far, and many others, such as C-edNCE when considering confluence, 
NCE, NLC, dNLC, edNLC and edNCE (see Sec. 3.3 for the meaning of acronyms) are 
currently on going. Hyperedge replacement was introduced in the early 70s by Feder [27] 
and Pavlidis [58] and has been intensively investigated since then. Contrary to the node 
replacement approach, it is based on gluing. Please, see Sees. 3.3 and 3.4 for a quick 
introduction. 

It is possible to use logics to express graphs and to encode graph transformation. In 
Sec. 3.5 this kind of approach with monadic second order logic is reviewed presenting its 
foundations and main results. 4 . 

The relational approach (or algebraic-relational approach) is based on relational meth- 
ods to specifying graph rewriting (in fact it could be applied to more general structures 
than graphs). Once a graph is characterized as a relational structure it is possible to 
apply all relational machinery, substituting categories by allegories and Dedekind cate- 
gories. Probably, the main advantage is that it is possible to give local characterization 
of concepts. The roots of this approach seem to date back to the early 1970's with the 
papers of Kawahara [38] [39] [40] establishing a relational calculus inside topos theory. An 
overview can be found in Sec. 3.6. 

Our approach has been influenced by these approaches to a different extent, heavily 
depending on the topic. The basics of Matrix Graph Grammars are most influenced by 

4 Monadic Second Order Logics, MSOL, lie in between first and second order logics 
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the categorical approach, mainly by SPO in the shape of productions and to some extent 
of direct derivations. For application conditions and graph constraints, our inspiration 
comes almost exclusively from MSOL. Concerning the relational approach, our basic 
structure has a natural representation in relational terms but the development in both 
cases is very different. The influence of hyperedge replacement and node replacement, if 
any, is much more fuzzy. 

1.2 Motivation 

The dissertation that gave rise to this book started as a project to study simulation proto- 
cols (conservative, optimistic, etcetera) under graph transformation systems. In the first 
few weeks we missed a real algebraic approach to graph grammars. "Real" in the sense 
that there are algebraic representations of graphs very close to basic algebraic structures 
such as vector spaces (incidence or adjacency matrices for example) but the theories 
available so far do not make use of them. As commented above, the main objective of 
this book is to give an algebraization of graph grammars. 

One advantage foreseen from the very beginning was the fact that nice interpretations 
in terms of functional analysis and physics could be used to move forward, despite the 
fact that the underlying structure is binary so, if necessary, it was possible to bring in 
easily logics and its powerful methods. 

Our schedule included several increasingly difficult problems to be treated by our 
approach with the hope of getting better insight and understanding, trying to generalize 
whenever possible and, most importantly, providing a unified body of results in which 
all concepts and ideas would fit naturally. 

First things first, so we begin with the name of the book: Matrix Graph Grammars. 
It has been chosen to emphasize the algebraic part of the approach - although there 
are also logics, tensors, operators - and to recall matrix mechanics as introducced by 
Born, Heisenberg and Jordan in the first half of the twentieth century. 5 You are kindly 
invited to visit http://www.mat2gra.info for further research, a webpage dedicated to 
this topic that I (hopefully) intend to maintain. 

5 An alternative was YAGGA, which stands for Yet Another Graph Grammar Approach (in 
the style of the famous "Yet Another..." series). 
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Section 1.1 points out that motivations of some graph grammar approaches have 
been quite close to practice, in contrast with Matrix Graph Grammars (MGG) which is 
more theoretically driven. Nonetheless, there is an on-going project to implement a graph 
grammar tool based on AToM 3 (see [44] or visit http://atom3.cs.mcgill.ca/) using 
algorithms derived from this book (the analysis algortihms are expected to have a good 
performance). We will briefly touch on this topic in Sec. 5.3. Appendix A illustrates all 
the theory with a more or less realistic case study. 

This "basis for theoretical studies" intends to provide us with the capability of solving 
theoretical problems as those commented below, which are the backbone of the book. 

Informally, a grammar is a set of productions plus an initial graph which we can think 
of as a collection of functions plus an initial set. A sequence of productions would then 
be a sequence of functions, applied in order. Together with the function we specify the 
elements that must be found in the initial set (in its domain), so in order to apply a 
function we must first find the domain of the function in the initial set (this process is 
known as matching) . As productions are applied, the system moves on transforming the 
initial set in a sequence of intermediate sets to eventually arrive to a final state (final 
set). 6 Actually, we will deal neither with sets nor with functions but with directed graphs 
and morphisms. 

So far we will speak of graphs, digraphs or simple digraphs meaning in all cases simple 
digraphs. See Sec. 2.3 for its definition and main properties. 

Once grammar rules have been defined and its main properties established, the first 
problem we will address is the characterization of applicability, i.e. give necessary and 
sufficient conditions to guarantee that a sequence can be applied to an initial state (also 
known as host graph) to output a final state (a graph again) . Formally stated for further 
reference: 

Problem 1 (Applicability) For a sequence s n made up of rules in grammar (5 and a 
simple 7 digraph G, is it possible to apply s n to the host graph G ?. 

6 The natural interpretation is that functions modify sets, so some dynamics arise. 

7 Defined in Sec. 2.3. 
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No restriction is set on the output of the sequence except that it is a simple digraph. 
There is a basic problem when deleting nodes known as dangling condition: Arc all 
incident edges eliminated too?. Otherwise the output would not a be digraph. 

When we have a production and a matching (for that production) we will speak of a 
direct derivation. A sequence of direct derivations is called a derivation. 

A quite natural progression in the study of grammars is the following question, that 
we call independence problem. 8 

Problem 2 (Independence) For two given derivations d n and d' n applicable to host 
graph G, do they reach the same state?, i.e. is d n (G) = d' n (G)?. 

Mind the similarities with confluence and local confluence (see below). However, in- 
dependence is a very general problem and we will be interested in a reduced version of 
it, known as sequential independence, which is widely addressed in the graph grammar 
literature and also in other branches of computer science. As far as we know, in the 
literature [22; 23] this problem is addressed for sequences of two direct derivations, being 
longer sequences studied pairwise. 

Problem 3 (Sequential Independence) For two derivations d n and d' n = a(d n ) ap- 
plicable to host graph G, with a a permutation, do they reach the same state 1 ?. 

Of course, problems 2 and 3 can be extended easily to consider any finite number of 
derivations and, in both cases, there is a dependence relationship with respect to problem 
1. 

Our next step will be to generalize some theory from Petri nets [53] , which can be seen 
as a particular case of Matrix Graph Grammars. In particular, our interest is focused on 
reachability: 

Problem 4 (Reachability) For two given states (initial So and final St), is there any 
sequence made up of productions in G that transforms So into St ?■ 

In the theory developed so far for Petri nets, reachability is addressed using the 
state equation (linear system) which is a necessary condition for the existence of such a 
sequence (see Chap. 8). 

8 Independence from the point of view of the grammar. It does not matter which path the 
grammar follows because in both cases it finishes in the same state. 
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Problem 4 directly relies on problem 1. More interestingly, it is also related to prob- 
lems 2 and 3: As every solution provided by the state equation specifies the set of pro- 
ductions to be applied but not the order (see Sec. 8.1), sequences associated to different 
solutions of the state equation can be independent but not sequential independent (this 
is because different sets of solutions apply each production a different number of times). 
So, in particular, reachability can be useful to split independence and sequential inde- 
pendence. 



Sequential 
Independence 



Independence 



Applicability 



1 ► Reachability 

Fig. 1.2. Partial Diagram of Problem Dependencies 



All these problems with their corresponding dependencies are summarized in Fig. 1.2. 
Compare with the complete diagram that includes mid-term and long-term research in 
Fig. 9.1 on p. 234. 

Although we will not study confluence in this book (except some ideas in Chap. 9), 
just to make a complete account two further related problems are introduced. We will 
briefly review them in the last chapter. 

Problem 5 (Confluence) For two given states Si and S2, do there exist two deriva- 
tions d\ and e?2 such that d\{S\) = (^(S^) — 

Strictly speaking this is not confluence as defined in the literature [74]. On the left 
of Fig. 1.3 you can find confluence: For the initial state So that independently evolves 
to S\ and S2, is it possible to find derivations that close the diamond?. 9 On the right of 

9 The difference between local confluence and confluence is that in the former to move from So 
to Si or S2 it is mandatory to use a direct derivation and not a derivation. 
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the same figure we have represented problem 5. The difference is that a common initial 
state is not assumed. 




s. 



V 



Fig. 1.3. Confluence 



In mathematics, existence and uniqueness theorems are central to any of its branches. 
As it is, the analogous terms in computer science are termination and confluence, respec- 
tively. 

In some sense we may think of reachability as opening or broadening the state space 
of a given grammar while confluence, as introduced here, closes or bounds it. 

Problem 5 deals with confluency of confluence. The other part (how to actually get to 
the states S\ and S2) is more related to reachability. Note that if one of the derivations 
is the identity then problem 5 becomes problem 4 (reachability). 

If we limit to permutation of sequences, as in the derivation of problem 3 out of 
problem 2, we can pose: 

Problem 6 (Sequential Confluence) For two given initial states, do there exist two 
derivations (one permutation of the other) with isomorphic final states?. 

Again, it is not difficult to make them consider any finite set of derivations instead of 
just two. Once we know if a grammar is confluent, the next step is to know how much it 
takes to get to its final state. This is very close to complexity. Complexity theory is not 
addressed in this book. 

To the best of our knowledge, applicability (problem 1) has not been addressed up to 
now. Independence and sequential independence (problems 2 and 3) are very popular. 10 

10 Actually, it is sequential independence the one normally addressed in the literature. We have 
introduced independence for its potential link with confluence. 
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See for example Chaps. 3 and 4 in [23]. Reachability is a key concept and has been 
studied and partially characterized in many papers, mainly in Petri nets theory. See 
[53]. Confluence is a concept of fundamental importance to grammar theory. For term 
rewriting systems see [30]. 

1.3 Book Outline 

Based on the problems commented in previous section, the book is organized in nine 
chapters plus one appendix. The First three chapters, including this one, are introductory. 
Chapter 2 provides a short overview of needed mathematical machinery which includes 
some basic results from logics (first and monadic second order), category theory, tensor 
algebra, graph theory, functional analysis (notation and some basic results) and group 
theory. We have not used advanced results on any of these disciplines so probably a quick 
review should suffice, mainly for fixing notation. 

Graph grammars approaches are discussed in Chap. 3, which essentially expands 
the overview in Sec. 1.1. Sections 3.1 and 3.2 cover algebraic approaches, for which we 
prefer the term categorical. Set-theoretic approaches (node and hyperedge replacement) 
are covered in Sees. 3.3 and 3.4. Term rewriting through monadic second order logics is 
the MSOL approach, to which Sec. 3.5 is devoted. The chapter ends with the relational 
approach in Sec. 3.6. The objective of this chapter is to get an idea of each approach 
(and not to provide a detailed study) in order to, among other things, ease comparison 
with Matrix Graph Grammars. 

Chapter 4 introduces the basics of our proposal (Sec. 4.1) and prepares to attack prob- 
lem 1 by introducing concepts such as completion (Sec. 4.2), coherence and sequences 
(Sec. 4.3), nihilation matrix and minimal and negative initial digraphs (in Sec. 4.4, sub- 
sequently generalized to initial digraph set in Sec. 5.3), composition and compatibility 
(Sec. 4.5) and theorems related to their properties and characterizations. 

Chapter 5 covers an essential part of production applicability: Matching the left hand 
side (LHS) of a production inside the host graph. Dangling edges are covered, dealing 
with them with what we call e-productions in Sec. 5.1 and further studied and classified 
in Sec. 5.4. We deal with marking in Sec. 5.2, which can help in case it is necessary to 
guarantee that several productions have to be applied in the same place. Minimal and 
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negative initial digraphs are generalized to the initial digraph set in Sec. 5.3. In Sec. 5.5 
we give two characterizations for applicability (problem 1). 

We will cope with sequential independence (problem 3) for quite general families of 
permutations in Chap. 6. Sameness of minimal initial digraph (called G-congruence) for 
two sequences is addressed in Sec. 6.2; the case of two derivations is seen in Sec. 6.3. 
Explicit parallelism is studied in Sec. 6.4 through composition and G-congruence, which 
is related to initial digraph sets. 

In Chap. 7 graph constraints and application conditions (preconditions and postcon- 
ditions) are studied for Matrix Graph Grammars. They are introduced in Sec. 7.1 where 
a short overview of related concepts in other graph grammars approaches is carried out. 
The notion of direct derivation is extended to cope with application conditions in Matrix 
Graph Grammars in a very natural manner in Sec. 7.2 and functionally represented in 
Sec. 7.3. In Sec. 7.4 we show how it is possible to transform postconditions into precondi- 
tions and viceversa. Both of theoretical and of practical importance is the use of variable 
nodes because, among other things, it allows us to automatically extend the theory to 
include multidigraphs without any change of the theory of Matrix Graph Grammar in 
Sec. 7.5. 

In Chap. 8 problem 4 (reachability) is tackled, extending results from Petri nets to 
more general grammars. Section 8.1 quickly introduce this theory and summarizes some 
basic results. Section 8.2 applies some results from previous chapters to Petri nets. The 
rest of the chapter is devoted to extending Petri nets results for reachability to Matrix 
Graph Grammars, in particular Sec. 8.3 covers graph grammars without dangling edges 
while Sec. 8.4 deals with the general case. 

The book ends in Chap. 9 with the conclusions and further research. A summary of 
what we think are our most important contributions can be found there. 

Finally, in appendix A a fully worked case study is presented in which all main 
theorems are applied together with detailed explanations and implementation remarks 
and advices. 

Most of the material presented in this book has been published [59], [60], [61], [62], 
[63] and [64] and presented in international congresses: ICM'2006 (International Congress 
of Mathematicians, awarded with the second prize of the poster competition in Sec- 
tion 15, Mathematical Aspects of Computer Science), ICGT'2006 (International Confer- 
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ence on Graph Transformations), PNGT'2006 (Petri Nets and Graph Transformations), 
PROLE'2007 (VII Jornadas sobre Programacion y Lenguajes) and GT-VC'2007 (Graph 
Transformation for Verification and Concurrency, in CONCUR'2007). 



2 



Background and Theory 



The Matrix Graph Grammar approach uses many mathematical theories which might 
seem distant one from the others. Nevertheless, there are some interesting ideas connect- 
ing them which we seize to contribute whenever possible. Matrix Graph Grammars do 
not depend on any novel theorem that opens a new field of research, but aims to put 
"old" problems in a new perspective. 

There are excellent books available covering every subject of this topic. There are 
also excellent resources on the web. We think that this fast introduction should suffice. 
It is intended as a reference chapter. All concepts are highlighted in bold to ease their 
location. 

2.1 Logics 

Logics are of fundamental importance to Matrix Graph Grammars for two reasons. First, 
graphs are represented by their adjacency matrices. As we will mainly deal with simple 
digraphs, they can be represented by Boolean matrices (we will come back to this in Sec. 
2.3). 1 Second, Chap. 7 generalizes graph constraints and application conditions using 
monadic second order logics. Good references on mathematical logics are [47] and [71]. 

First-order predicate calculus (more briefly, first order logic, FOL) generalizes prepo- 
sitional logic, which deals with propositions: A statement that is either true or false. 

1 Multidigraphs are also addressed using Boolean matrices. Refer to Sec. 7.5. 
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FOL formulas are constructed from individual constants (a, b, c, etc., typically lower- 
case letters from the beginning of the alphabet), individual variables (x, y, z, etc., typically 
lower-case letters from the end of the alphabet), predicate symbols (P, Q, R, etc., typically 
upper-case letters), function symbols (f, g, h, etc., typically lower-case letters from the 
middle of the alphabet), propositional connectives (—*, a, v, =>, •») and quantifiers (V, 
3). Set C will be that of individual constants, set T will be function symbols and set V will 
contain predicate symbols. Besides these elements, punctuation symbols are permitted 
such as parenthesis and commas. 

A formula in which every variable is quantified is a closed formula (open formula 
otherwise). A term (formula) that contains no variable is called ground term (ground 
formula). The arity of any predicate function / is its number of arguments, normally 
written as an upper index, /", if needed. 

The rules for constructing terms and formulas are recursive: Every element in C is a 
term, as it is any individual variable and also f n (ti, . . . , t n ), where f n e T and U are 
terms. Also, P e V is a formula 2 and the application of any propositional connective or 
quantifier (or both) to two or more predicates is also a formula. 

In fact, constants are formulas of arity zero so it would be convenient to omit them 
and allow formulas of any arity. Nevertheless we will follow the traditional exposition 
and use the term function when arity is at least 1. 

Example. □ As an example of FOL formula, one of the inference rules of predicate 
calculus is written: 

3xP(x) a VxQ(x) => 3x [P(x) a Q(x)] . 

It reads as if there exists x for which P and for all x Q, then there exists x for which 
P and Q. For another example, let's consider the language of ordered abelian groups. It 
has one constant 0, one unary function — , one binary function + and one binary relation 
«=. 

• 0, x, y are atomic terms. 

• +(x,y), +(x, +(y, —(z))) are terms, usually written in infix notation as x + y, x + 

(w + (-*))■ 

2 It is called atomic formula. 
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• = (+(x,y),0), ^ (+(x,+(y,—(z))),+(x,y)) are atomic formulas, usually written in 
infix notation as x + y = 0, x + y — z ^ x + y. 

• (Vx3y ^ (+(x,y), z)) a (3x = (+(x,y),0)) is a formula, more readable if written as 
(Vx3y x + y ^ z) a (3x a; + y = 0)M 

The semantics of our language depend on the domain of discourse (D) and on 
the interpretation function /. The domain of discourse (also known as universe of 
discourse) is the set of objects we use the FOL to talk about and must be fixed in 
advance. In the example above, for a fixed abelian group, the domain of discourse are 
the elements of the group. 

For a given domain of discourse D it is necessary to define an interpretation function / 
which assigns meanings to the non-logical vocabulary, i.e. maps symbols in our language 
onto the domain: 

• Constants are mapped onto objects in the domain. 

• 0-ary predicates are mapped onto true or false, i.e. whether they are true or false in 
this interpretation. 

• N-ary predicates are mapped onto sets of n-ary ordered tuples of elements of the 
domain, i.e. those tuples of members for which the predicate holds (for example, a 
1-ary predicate is mapped onto a subset of D). 

The interpretation of a formula / in our language is then given by this morphism 
/ together with an assignment of values to any free variables in /. If S is a variable 
assignment on / then we can write (I, S) |= / to mean that I satisfies / under the 
assignment S (/ is true under interpretation / and assignment S). Our interpretation 
function assigns denotations to constants in the language, while S assigns denotations to 
free variables. 

First-order predicate logic allows variables to range over atomic symbols in the domain 
but it does not allow variables to be bound to predicate symbols, however. A second 
order logic (such as second order predicate logic, [47]) does allow this, and sentences 
such as VP[P(2)] (all predicates apply to number 2) can be written. 

Example. □Starting out with formula: 



(3(X) = Var, y, z [(P(x, y) a P(x, z) => y = z) a (P(x, z) a P(y, z) => x = y)} 
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which expresses injectiveness of a binary relation P on its domain, it is possible to give 
a characterization of bijection (X) between two sets (Yi, Y 2 ): 

3X [f3(X) a V.t (FrOr) o 3yX(z, y)) a (Y 2 (x) 3yX(y,x))] . 

The bijection X is a binary relation and the sets Y\ and Y 2 are unary relations. Hence, 
Y\{x) is the same as x e Y\. See [23], pp. 319-320 for more details. 

Another example is the least upper bound (lub) property for sets of real numbers 
(every bounded, nonempty set of real numbers has a supremum): 

MA [(3w(w e A) a 3z\/w(w e A => w ^ z)) => 3xVy (Viu e A, (w ^ y) o x ^ y)] . 

m 

Second order logic (SOL) is more expressive than FOL under standard semantics: 
Quantifiers range over all sets or functions of the appropriate sort (thus, once the domain 
of the first order variables is established, the meaning of the remaining quantifiers is 
fixed). It is still possible to increase the order of the logic, for example by allowing 
predicates to accept arguments which are themselves predicates. 

Chapter 7 makes use of monadic second order logic, MSOL for short, 3 which lies 
in between first order and second order logics. Instead of allowing quantification over n- 
ary predicates, MSOL quantifies 0-ary and 1-ary predicates, i.e. individuals and subsets. 
There is no restriction on the arity of predicates. 

A theorem by Biichi and Elgot [7] [26] (see also [75]) states that string languages 
generated by MSOL formulas correspond to regular languages (see also Sec. 3.5), so we 
have an alternative to the use of regular expressions, appropriate to express patterns 
(this is one of the reasons to make use of them in Chap. 7). 4 Another reason is that 
properties as general as 3-colorability of a graph (see [23], Chap. 5 and also Sec. 7.1) can 
be encoded using MSOL so, for many purposes, it seems to be expressive enough. 

3 In the literature there are several equivalent contractions such as MS, MSO and M2L. 

4 See [52] for an introduction to monadic second order logic. See [29] for an implementation of 
a translator of MSOL formula into finite-state automata. 
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2.2 Category Theory 

Category theory was first introduced by S. Eilenberg and S. Mac Lane in the early forties 
in connection with their studies in homology theory (algebraic topology), see [25]. The 
reference book in category theory is [49]. There are also several very good surveys on 
this topic on the web such as www.cs.utwente.nl/~fokkinga/mmf92b.pdf. 

A category C is made up of a class 5 of objects, a class of morphisms and a binary 
operation called composition of morphisms, (Obj(C),Hom(C),o). Each morphism / has 
a unique source object and a unique target object, / : A — » B. There are two axioms for 
categories: 

1. if / : A — » B, g : B — » C and h : C — > D then h o (g o /) = (hog) o f (associativity). 

2. VX 31 x ■ X — > X such that V/ : A — » B it is true that 1 B o f = f = f o 1 A (existence 
of the identity morphism) . 

An object A is initial if and only if V-B 3!/ : A — > B, and terminal if VB 3lg : B — > A. 

Not all categories have initial or terminal objects, although if they exist then they are 
unique up to a unique isomorphism. 

ExamplenOnc first example is the category Set, where objects are sets and mor- 
phisms are total functions. Doing set theory in the categorical language forces to express 
everything with function composition only (no explicit arguments, membership, etcetera). 

Let's see that morphisms need not be functions. For example, any directed graph 
determines a category in which each node is one object and each directed edge is a 
morphism. Composition is concatenation of paths and the identity is the empty path. 
This category is at times called Path category. 

Similarly, any preordered set (A, ^) can be thought as a category. Objects are in this 
case the elements of A (a, b e A), and there is a morphism between two given elements 
whenever a ^ b. The identity is a ^ a. 6 

5 A class is a collection of sets, or other mathematical objects. A class that is not a set is called 
proper class and has the properties that it can not be an element of a set or a class and is 
not subject to the Zermelo-Fraenkel axioms, thereby avoiding some paradoxes from naive set 
theory. 

6 These three examples can be found in [28] . 
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The empty set 0 is the only initial object and every singleton object (one-element 
set) is terminal in category Set. If as before (A, ^) is a preordered set, A has an initial 
object if and only if it has a smallest element, and a terminal object if and only if A has 
a largest element. In the category of graphs (to be defined soon) the null graph - the 
graph without nodes and edges - is an initial object. The graph with a single node and 
a single edge is terminal, except in the category of simple graphs without loops which 
does not have a terminal object. ■ 

Example. nA multigraph G = (V, E, s, t) consists of a set V of vertices and a set E 
of edges. Functions source and target s,t : E — > V respectively return the initial node 
and the final node of an edge. 

A graph morphism / : G\ — > G2, with / = (fv,fE), consists of two functions 
fv ■ V\ — » V 2 and : E\ — > E 2 such that fv 0 s\ = s 2 o f E and fv ° ti = t 2 ° Je- 
Composition is defined componentwise, i.e. given /1 : G\ — > G 2 and f 2 : G 2 — » G3 then 
h 0 h = (f2,v 0 fi,v, h,E 0 Ji,e) ■ Gi G 3 . 

The category of graphs with total morphisms will be denoted Graph and Graph p 
if morphisms arc allowed to be partial. Graph p will be more interesting for us.B 

Let C and T> be two categories. A functor F : C — > T> is a mapping 7 that associates 
objects in C with objects in V (for some X e C, F(X) e V) and morphisms in C with 
morphisms in T>: 

f:X^YJeC, F(f) : F(X) - F(Y), F(f) e V. (2.1) 

Any functor has to keep the category structure (identities and composition), i.e. it must 
satisfy the following two properties: 

1. VAeC, F(l x ) = l F(x) . 

2. V/ : X -> Y, g : Y -> Z we have that F(g o /) = F(g) o F(f). 

Example. oThe constant functor between categories C and T> sends every object in 
C to a fixed object in V. The diagonal functor is defined between categories C and C v 
and sends each object in C to the constant functor in that object. 8 Let C denote the 
category of vector spaces over a fixed field, then the tensor product V (x) W defines a 
functor CxC^CI 

7 Functors can be seen as morphisms between categories. 

8 C° is the class of all morphisms from T> to C 
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C U 9 >Y 

F(U) < - X - > F(Y) 

Fig. 2.1. Universal Property 

All constructions that follow can be characterized by some abstract property that de- 
mands, under some conditions, the existence of a unique morphism, known as universal 
properties. 

One concept constantly used is that of universal morphism, which can be easily 
recognized in the rest of the section: Let F : C — » T> be a functor and let X e V, a 
universal morphism from X to F — where U e C and u : X — > F(U) - is the pair ([/, it) 
such that VFe C and V/: X -> F(y), 3!g : [/ — > F satisfying: 9 

/ = n<?) ° «■ 

See Fig. 2.1 where blue dotted arrows delimit the commutative triangle (u, /, F(g)). 



F 

T 




Fig. 2.2. Product, Cone and Universal Cone 



In fact, this is a universal property for universal morphisms. 
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The product of objects X and Y is an object P and two morphisms 27 x ■ P — * X 
and Piy ■ P — > Y such that P is terminal. This definition can be extended easily to an 
arbitrary collection of objects. 

A cone from N e V to functor F : C — » T> is the family of morphisms jx ■ N — > F(X) 
such that V/ : J -> F, / 6 C we have 07^ = 7^. 

A limit is a universal cone, i.e. a cone through which all other cones factor: A cone 
(L,6x) of a functor F : C — > I? is a limit of that functor if and only if for any cone 
(A, 7x) of F, 3\u : N — » L such that 7x = 0 w (L is terminal). See Fig. 2.2. 
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Fig. 2.3. Pushout and Pullback 



A pullback 10 is the limit of a diagram 11 consisting of two morphisms / : X — > Z 
and g : y — > Z with a common codomain. 

By reverting all arrows in previous definitions 12 we get the dual concepts: Coprod- 
uct, cocone, colimit and pushout. A pushout 13 is the colimit of a diagram consisting 
of two morphisms / : X — > Y and g : X — > Z with a common domain and can be 
informally interpreted as closing the square represented on the left of Fig. 2.3 by defining 
the red dashed morphisms jz and yy. Fine blue dotted morphisms (5y, 8z and 5po) 

10 Also known as fibered product or cartesian square. 

11 Informally, the diagram is the one appearing on the left of Fig. 2.3. Formally, a diagram of 
type / - the index or scheme category - in category C is a functor D : I — > C. What objects 
and morphisms are in I is irrelevant. Only the way in which they are related is of importance. 

12 Reverting arrows is at times called duality. 

13 Also known as fibered coproducts or fibered sums. 
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illustrate the universal property of PO of being the initial object. We will see in Sees. 3.1 
and 3.2 that the basic pillars of categorical approaches to graph transformation are the 
pushout and pullback diagrams depicted in Fig. 2.3. 

Pushout constructions are very important to graph transformation systems, in par- 
ticular to SPO and DPO approaches, but also used to some extent by most of the rest 
of the categorical approaches. The intuition of a pushout between sets A, B and C as in 
Fig. 2.4 is to glue sets B and C through set A or, in other words, put C where A is in B. 
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Fig. 2.4. Pushout as Gluing of Sets 



A pushout complement is a categorical construction very similar to PO and PB. 
In this case, following the notation on the left of Fig. 2.3, / and would be given and 
g, jz and Z need to be defined. 

Roughly speaking, an initial pushout is an initial object in the "category of 
pushouts". 14 Supose we have a pushout as in the left of Fig. 2.3, then it is said to 
be initial over 7y if for every pushout /' : X' — » Y and j'z '■ % — > PO (refer to Fig. 2.5) 
there exist unique morphisms / : X — * X' and Yz : Z —* Z' such that: 

1- / = /' ° / and lz = i z ° lz- 

2. The square defined by over lined morphisms (/, g,7r,7z) is a pushout. 



Initial pushouts are needed for the gluing condition and to define HLR categories. 
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Fig. 2.5. Initial Pushout 



Now we will introduce adhesive HLR categories 15 which are very important for a 
general study of graph grammars and graph transformation systems. See Sec. 3.2 for an 
introduction or refer to [22] for a detailed account. 

Van Kampen squares arc pushout diagrams closed in some sense under pullbacks. 
Given the pushout diagram (p,m,p* ,m*) on the floor of the cube in Fig. 2.6 and the 
two pullbacks (m,g',m',l r ) and (p,r',p',l') of the back faces (depicted in dashed red) 
then the front faces (p*,h',p'*,g') and (m*,h',m'*,r') (depicted in dashed blue) are 
pullbacks if and only if the top square (p',m',p'*,m'*) is a pushout. Even in category 
Set not all pushouts are van Kampen squares, unless the pushout is defined along a 
monomorphism (an injective morphism). We say that (p, m,p*,m*) is defined along a 
monomorphism if p is injective (symmetrically, if m is injective). A category has pushouts 
along monomorphisms if at least one of the given morphism is a monomorphism. 

We will be interested in so-called adhesive categories. A category C is called adhesive 
if it fulfills the following properties: 

1. C has pushouts along monomorphisms. 

2. C has pullbacks. 

3. Pushouts along monomorphisms are van Kampen squares. 

There are important categories that turn out to be adhesive categories but others are 
not. For example, Set and Graph arc adhesive categories but Poset (the category of 
partial ordered sets) and Top (topological spaces and continuous functions) are not. 

15 HLR stands for High Level Replacement. 



2.2 Category Theory 25 




Fig. 2.6. Van Kampen Square 



Axioms of adhesive categories have to be weakened because there are important cat- 
egories for graph transformation that do not fulfill them as, e.g. typed attributed graphs. 
The main difference between adhesive categories and adhesive HLR categories is that 
adhesive properties are demanded for some subclass M of monomorphisms and not for 
every monomorphism. A category C with a set of morphisms M. is an adhesive HLR 
category if: 

1. M. is closed under isomorphism composition and decomposition (go f e Ai, g e M => 
feM). 

2. C has pushouts and pullbacks along .M-morphisms and M-morphisms are closed 
under pushouts and pullbacks. 

3. Pushouts in C along .M-morphisms arc van Kampen squares. 

Symmetrically to previous use of the term along, a pushout along an .M-morphism is 
a pushout where at least one of the given morphisms is in A4. 

Among others, category PTNets (place/transition nets) fails to be an adhesive HLR 
category so it would be nice to still consider wider sets of graph grammars by further 
relaxing the axiomatic of adhesive HLR categories. In particular third axiom can be 
weakened if only some cubes in Fig. 2.6 are considered for the van Kampen property. In 
this case we will speak of weak adhesive HLR categories: 
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3'. Pushouts in C along A4-morphisms are weak van Kampen squares, i.e. the van Kam- 
pen square property holds for all commutative cubes with p e Ai and m e M. or 
p e M and V, r', g' e M. 

Adhesive HLR categories enjoy many nice properties concerning push-out and pull- 
back constructions, allowing to move forwards and backwards easily in diagrams. Assum- 
ing all involved morphisms to be in M.: 

1. Pushouts along .M-morphisms are pullbacks. 

2. If a pushout is the composition of two squares in which the second is a pullback, then 
in fact both squares are pushouts and pullbacks. 

3. The symmetrical van Kampen property for pullbacks also holds (see Fig. 2.6): If 
the top square (G', H', R', V) is a pullback and the front squares (£?', G, H, H') and 
(H 1 , H, R, R 1 ) are pushouts, then the bottom (G, H, R, L) is a pullback if and only if 
the back faces (G', G, L, L') and (L 1 , L, R, R') are pushouts. 

4. Pushout complements are unique up to isomorphisms. 

It is necessary to be cautious when porting concepts to (weak) adhesive categories 
as morphisms involved in the definitions and theorems have to belong to the set of 
morphisms A4. 



2.3 Graph Theory 

In this section simple digraphs are defined, which can be represented as boolean matrices. 
Besides, basic operations on these matrices are introduced. They will be used in later 
sections to characterize graph transformation rules. Also, compatibility for a graph 16 - 
an adjacency matrix and a vector of nodes - is defined and studied. This paves the way 
to the notion of compatibility of grammar rules 17 and of sequences of productions. 18 

Graph theory is considered to start with Euler's paper on the seven bridges of 
Konisbcrg in 1736. Since then, there has been an intense research in the field by, among 
others, Cayley, Silvester, Tait, Ramsey, Erdos, Szemeredy and many more. Nowadays 

16 See definition 2.3.2. 

17 See definition 4.1.5. 

18 See Sec. 4.5. 
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graph theory is applied to a wide range of areas in different disciplines in both science 
and engineering, from computer science, chemistry, physics, topology, etcetera. Among 
its main branches we can cite extremal graph theory, geometric graph theory, algebraic 
graph theory, probabilistic (also known as random) graph theory and topological graph 
theory. We will just use some basic facts from algebraic graph theory. 

The category of graphs has been introduced in Sec. 2.2. An easy way to define a 
simple digraph G = (V, E) is as the structure that consists of two sets, one of nodes 
V = {Vi I i e 1} and one of edges E = {{Vi, Vj) eVxV} (think of arrows as connecting 
nodes). 19 Prefix di means that edges are directed and term simple that at most one arrow 
is allowed between the same two nodes. For example, the complete simple digraph with 
three vertices and two examples of four and five vertices can be found in Fig. 2.7. 



Any simple digraph G is uniquely determined through one of its associated matrices, 
known as adjacency matrix Ac, whose element is defined to be one if there exists 
an arrow joining vertex i with vertex j, and zero otherwise. This is not the only possible 
characterization of graphs using matrices. 

The incidence matrix is an m x n matrix I™, where m is the number of nodes and 
n the number of edges, 20 such that 7j = — 1 if edge ej leaves the node and = 1 if 
edge ej enters the node (Jj = 0 otherwise). As it is possible to relate the adjacency and 

19 Mind the difference between this and having functions s and t, see for example [22]. 

20 The tensor notation is explained in Sec. 2.4. 




Fig. 2.7. Three, Four and Five Nodes Simple Digraphs 



28 2 Background and Theory 



incidence matrices through line graphs, we will mainly characterize graphs through their 
adjacency matrices. 21 

In addition, a vector that we call node vector Vq is associated to our digraph G, 
with its elements equal to one if the corresponding node is in G and zero otherwise. It 
will be necessary because we will study sequences of productions, which probably apply 
to different graphs. Their adjacency matrices will then refer to different sets of nodes. 
In order to operate algebraically we will complete all matrices. 22 Node vectors are used 
to distinguish which nodes belong to the graph and which ones have been added for 
algebraic operation consistency. Next example illustrates this point. 

Example. nThe adjacency matrices A E and C E for first and third graphs of Fig. 2.7 

arc: 
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where A N and C N are the corresponding node vectors. A vertically separated column 
indicates node ordering, which applies both to rows and columns. Note that edges incident 
to node 4 are considered in matrix A E . As there is no node 4 in A, corresponding elements 
in the adjacency matrix are zero. To clearly state that this node does not belong to graph 
A we have a zero in the fourth position of A N M 

Note that simple graphs (without orientation on edges) can be studied if we limit 
to the subspace of symmetric adjacency matrices. In Sec. 7.5 we study how to extend 
Matrix Graph Grammars approach to consider multigraphs. The difference between a 
simple graph and a multigraph is that simple graphs allow a maximum of one edge 

21 The line graph L(G) of graph G is a graph in which each vertex of L(G) represents an edge of 
G and two nodes in L(G) are incident if the corresponding edges share an endpoint. Incidence 
and adjacency matrices are related throught the equation: 

A(L(G)) = B{G?B{G) - 21 

where A(L(G)) is the adjacency matrix of L(G), B(G) its incidence matrix and I the identity 
matrix. 

22 Refer to Sec. 4.2. 
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connecting two nodes in each direction, while a multigraph allows a finite number of 
them. 

In the literature, depending mainly on the book, there is some confusion with termi- 
nology. At times, the term graph applies to multigraphs while other times graph refers 
to simple graphs (also known as relational graphs). Whenever found in this book, and 
unless otherwise stated, the term graph should be understood as simple digraph. 

The basic Boolean operations on graphs are defined componentwise on their adjacency 
matrices. Let G and H be two graphs with adjacency matrices (5*) and (ft}), i,j e 
{!,... n}, then: 



Similarly to ordinary matrix product based on addition and multiplication by scalars, 
there is a natural definition for a boolean product with the same structure but using 
boolean operations and and or. 

Definition 2.3.1 (Boolean Matrix Product) For digraphs G and H, let Mg = 



Element in the boolean product matrix is one if there exists an edge joining 

node i in digraph G with some node k in the same digraph and another edge in digraph 
H starting in k and ending in j. The value will be zero otherwise. 

If for example we want to check whether node j is reachable starting in node j in n 

fe) 

steps or less, we may calculate Vfc=i A^ k \ where = A© ■ ■ ■ ©A, and test if element 
is one. 23 We will consider square matrices only as every node can be either initial 
or terminal for any edge. 

Another useful product operation that can be defined for two simple digraphs G\ and 
G2 is its tensor product (defined in Sec. 2.4) G = G\ ©G2: 

23 In order to distinguish when we are using the standard or boolean product, in the latter 
exponents will be enclosed between brackets. 



GvH=(g*vh<) GAH=(g i j Ahi) G = . 




n 




(2.2) 
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1. The nodes set is the cartesian product V(G) = V{G\) x V(G2)- 

2. Two vertices u\ (x) w 2 and v\ (x) v-i are adjacent if and only if u\ is adjacent to v\ in 
G\ and ui is adjacent to V2 in G 2 . 

In Sec. 2.4 we will see that the adjacency matrix of G coincide with the tensor product 
of the adjacency matrices of G\ and G 2 . 

The following definition, definition 2.3.3, proposition 2.3.4 and the introduction above 
of the nodes vector is not standard in graph theory (in fact, as far as we know, we are 
introducing them). The decission of including them in this introductory section is because 
they are simple results very close with what one understands as "basics" of a theory. 

Given an adjacency matrix and a vector of nodes, a natural question is whether they 
define a simple digraph or not. 

Definition 2.3.2 (Compatibility) A boolean matrix M and a vector of nodes N are 
compatible if they define a simple digraph: No edge is incident to any node that does not 
belong to the digraph. 

An edge incident to some node which does not belong to the graph (has a zero in the 
corresponding position of the nodes vector) is called a dangling edge. 

In the DPO/SPO approaches, this condition is checked when building a direct deriva- 
tion, known as dangling condition. The idea behind it is to obtain a closed set of entities, 
i.e. deletion of nodes outputs a digraph again (every edge is incident to some node). 
Proposition 2.3.4 below provides a criteria for testing compatibility for simple digraphs. 

Definition 2.3.3 (Norm of a Boolean Vector) Let N = (v\, . . . ,v n ) be a boolean 
vector. Its norm || is given by: 



i=l 

Proposition 2.3.4 A pair (M,N), where M is an adjacency matrix and N a vector of 
nodes, is compatible if and only if 



n 




(2.3) 




(2.4) 



where t denotes transposition. 
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Proof 

□In an adjacency matrix, row i represents outgoing edges from vertex i, while column j 
are incoming edges to vertex j. Moreover, (M) ik a (N) k = 1 if and only if (M) ik = 1 
and (N) k = 0, and thus the i — th element of vector M 0 N is one if and only if there is 
a dangling edge in row number i. We have just considered outgoing edges; for incoming 
ones we have a very similar term: M* 0 N. To finish the sufficient part of the proof - 
necessity is straightforward - we or both terms and take norms to detect if there is a 

i.a 

Remark. n\Ve have used in the proof of proposition 2.3.4 distribution of 0 and v, 
(Mi v M 2 ) 0M 3 = (Mi 0 M 3 ) v (M 2 0 M 3 ). In addition, we also have the distributive 
law on the left, i.e. M 3 0(Mi v M 2 ) = (Ms 0 Mi) v (M 3 0 M 2 ). Besides, it will be stated 
without proof that || uj\ v = II Hi v ll^li-B 

In Chap. 5 we will deal with matching, i.e. finding the left hand side of a graph 
grammar rule in the initial state (host graph). A matching algorithm is not proposed; 
our approach assumes that such algorithm is given. This is closely related to the well 
known graph-subgraph isomorphism problem (SI) which is an NP-complete decision 
problem if the number of nodes in the subgraph is strictly smaller than the number of 
nodes in the graph. We will brush over complexity in Chap. 9.3. 

2.4 Tensor Algebra 

Throughout the book, quantities that can be represented by a letter with subscripts or 
superscripts attached 24 will be used, together with some algebraic structure (tensorial 
structure). This section is devoted to a quick introduction to this topic. Two very good 
references are [33] (with relations to physics) and the almost classic book [72]. 

A tensor is a multilinear application between vector spaces. It is at times interesting 
to stay at a more abstract level and think of a tensor as a system that fulfills certain 
notational properties. Systems can be heterogeneous when there are different types of 
elements, but we will only consider homogeneous systems. Therefore we will speak of 
systems or tensors, it does not matter which. 

24 For example, Aj k 
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The rank 25 of a system (tensor) is the number of indices it has, taking into account 
whether they are superscripts or subscripts. For example, A l - k is [^j-valent or of rank 
(1,2). Subscripts or superscripts are referred to as indices or suffixes. 

Algebraic operations of addition and substraction apply to systems of the same type 
and order. They are defined componentwise, e.g. Cj k = A l - k + B l - k , provided that some 
additive structure is defined on elements of the system. We do not follow the Einstein 
summation convention, which states that when an index appears twice, one in an upper 
and one in a lower position, then they are summed up over all its possible values. 

The product is obtained multiplying each component of the first system with each 
component of the second system, e.g. C™ ni = A l -®B mnl . Such a product is called outer 
product or tensor product. The order of the result is the sum of the orders of the 
factors and inherits all the indices of its factors. All linear relations are satisfied, i.e. for 
Vi, V2 6 V, w e W and v ®w eV ®W the following identities are fulfilled: 



1. (vi + V2) ® w = Vi ®w + V2 ®w. 

2. cv (x) w = v (x) cw = c(v (x) to). 

To categorically characterize tensor products note that there is a natural isomorphism 
between all bilinear maps from E x F to G and all linear maps from E (x) F to G. E®F 
has all and only the relations that are necessary to ensure that a homomorphism from 
E (x) F to G will be linear (this is a universal property). For vector spaces this is quite 
straightforward, but in the case of i?-modules (modules over a ring R) this is normally 
accomplished by taking the quotient with respect to appropriate submodules. 

Example. nThc Kronecker product is a special case of tensor product that we will 
use in Chap. 8. Given matrices A = (al 1 ) and B = (b l ?) , it is defined to be 



C = A®B = (c)) 



j jmpxnq 



where 



(2.5) 



being i = (i 1 — l)n + i 2 and j = (ji — l)m + j 2 . The notation A = ( a j) mxn denotes a 
matrix with m rows and n columns, i.e. i e {1, . . . , m} and j e {1, . . . , n}. As an example: 



B 



1x2 



b\ b\ 



b\ b\ 



C = A®B 



2x2 



a\b\ a\b\ a\b\ a\b\ 
a\b\ a\b\ a\b\ a\b\ 



The terms order and valence are commonly used as synonyms. 
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Note that the Kronecker product of the adjacency matrices of two graphs is the 
adjacency matrix of the tensor product graph (see Sec. 2.3 for its definition). ■ 

The operation of contraction happens when an upper and a lower indices are set 
equal and summed up, e.g. Cj m ™' .— > C mnl = Cj mnl = Xi=j C} mnl . For example, 

the standard multiplication of a vector by a matrix is a contraction: Consider matrix A* 
and vector v k with i, j, fee {1, . . . , n}, then matrix multiplication can be performed by 
making j and k equal and summing up, u % = Ylj=i ■ 

The inner product is represented by < • , • ) and is obtained in two steps: 

1. Take the outer product of the tensors. 

2. Perform a contraction on two of its indices. 

In Sec. 2.5 we will extend this notation to cope with graph grammar rules representation. 

Upper indices are called contravariant and lower indices covariant. Contravariance 
is associated to the tangent bundle (tangent space) of a variety and corresponds, so to 
speak, to columns. Covariance is the dual notion and is associated to the cotangent bundle 
(normal space) and rows. As an example, if we have a vector V in a three dimensional 
space with basis {E 1 ,E 2 ,E 3 } then it can be represented in the form A = a 1 Ex + a?E 2 + 
a 3 E 3 . Components a 1 can be calculated via a 1 = (A, E 1 *) with (E l , Ej) = <5], where the 
Kronecker delta function is 1 if i = j and zero if i # j. Basis {E 1 } and {Ei} are called 
reciprocal or dual. 

We will not enter the representation of S in integral form or the relation with the Dirac 
delta function, of fundamental importance in distribution theory, functional analysis (see 
Sec. 2.5) and quantum mechanics. The Kronecker delta can be generalized to an ["]- 
valent tensor: 

fe=i 

Besides the kronecker delta, there are other very useful tensors such as the metric 
tensor, which can be informally introduced by Ei = E^ and gijE^ = Ei. Note that 
g raises or lowers indices, thus moving from covariance to contravariance and viceversa. 
Related to S and to group theory is the important Levi-Civita symbol: 
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+ 1 if cr is an even permutation. 
— 1 if cr is an odd permutation. 
0 otherwise. 



(2.7) 



where a = (i\ 



is a permutation of (1 



See Sec. 2.6 for definitions and 



further results. Symbols S and e can be related through matrix A = (a^i) 



5 ikjl and: 



(2.8) 



2.5 Functional Analysis 

Functional analysis is a branch of mathematics focused on the study of functions - oper- 
ators - in infinite dimensional spaces (although its results also apply to finite dimensional 
spaces). Besides the algebraic structure (normally a vector space but at times groups) 
some other ingredient is added such as an inner product (Hilbert spaces), a norm (Banach 
spaces) a metric (metric spaces) or just a topology (topological vector spaces). 

An operator is just a function, but the term is normally employed to call attention to 
some special aspect. Examples of operators in mathematics are differential and integral 
operators, linear operators (linear transformations), Fourier transform, etcetera. 

In this book we will call operators to functions that act on functions with image a 
function. Operators will be used, e.g. in Chap. 5 to modify productions in order to get a 
production or a sequence of productions. 

We will need to change productions as commented above and our inspiration comes 
from operator theory and functional analysis, but we would like to put it forward in a 
quantum mechanics style. So, although it will not be used as it is, we will give a very 
brief introduction to Hilbert and Banach spaces, bra-ket notation and duality. 

A Hilbert space H is a vector space, complete with respect to Cauchy sequences over 
a field K (every Cauchy sequence has a limit in H), plus a scalar (or inner) product. 26 

26 Inner product <■,■): H x Ti — > K axioms are: 

1. Vx,y e H,(x,y) = (y,x). 

2. Va,6 e K, Vx,y e Ti., (ax, by) = a(x,y) + b(x,y). 

3. Va; e Ti, (x, x} ^ 0 and (x, x} = 0 if and only if x = 0. 
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Completeness ensures that the limit of a convergent sequence is in the space, facilitating 
several definitions from analysis (note that a Hilbert space can be infinite-dimensional). 
The inner product - (u, v), u, v e TL - equips the structure with the notions of distance 
and angle (in particular perpendicularity). From a geometric point of view, the scalar 
product can be interpreted as a projection whereas analytically it can be seen as an 
integral. 

The inner product gives raise to a norm 27 || -|| via ||a;|| 2 = (x,x}, \jx e TL. Any norm 
can be interpreted as a measure of the size of elements in the vector space. Every inner 
product defines a norm but, in general, the opposite is not true, i.e. norm is a weaker 
concept than scalar product. 

The relationship between row and column vectors can be generalized from an abstract 
point of view through dual spaces. The dual space TL* of a Hilbert space TL over the 
field K has as elements x* e TL* , linear applications with domain (initial set) TL and 
codomain (image) the underlying field K, x* : TL — > K. 

The dual space becomes a vector space defining the addition Vx*, x | e TL*, x e TL by 
(x* + x*){x) = x*(x) +x%(x) and the scalar product Vfc e K by kx*(x) = x*(kx). Using 
tensor algebra terminology (see Sec. 2.4) elements of TL are called covariant and elements 
of TL* contravariant. Note how in (x,y) it is possible to think of x as an element of the 
vector space and y as an element of the dual space. 

Any Hilbert space is isomorphic (or anti-isomorphic) to its dual space, H = H* , 
which is the content of the Riesz representation theorem. This is particularly relevant to 
us because it is a justification of the Dirac bra-ket notation that we will also use. 

The Riesz representation theorem can be stated in the following terms: Let TL be 
a Hilbert space, TL* its dual and define 4> x {y) = (,x,y}, <j> e TL* . Then, the mapping 
<P : TL — > TL* such that x >-> <fi x is an isometric isomorphism. This means that # is a 
bijection and that \\x\\ = \\<fi x \\- 

27 Norm || ■ || : B — > K axioms are: 

1. Vx,y e B, \\x + y\\ ^ ||x|| + \\y\\. 

2. Va e K,Vx e B, \\ax\\ = \a\ ■ \\x\\. 

3. Vx 6 B, \\x\\ ^ 0 and = 0 if and only if x = 0. 
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We will very briefly introduce Banach spaces to illustrate how notions and ideas from 
Hilbert spaces, specially notation, is extended in a more or less natural way. 

A complete 28 vector space plus a norm is known as a Banach space, B. Associated 
to any Banach space there exists its dual space, £>*, defined as before. Contrary to Hilbert 
spaces, a Banach space is not isometrically isomorphic to its dual space. 

It is possible to define a distance (also called metric) out of a norm: d(x,y) = 
\\x — y\\. Even though there is no such geometrical intuition of projection nor angles, it 
is still possible to use the notation we are interested in. Given x e B, x* e £>*, instead 
of writing x* (x) (the result is an element of /C) at times (x, x*~) is preferred. Although 
the space and its dual live at different levels, we would like to recover this geometrical 
intuition of projection. In some (very nice) sense, the result of x* (x) is the projection of 
x over x*. 

The same applies for an operator T acting on a Banach space B,T : B — > B. Suppose 
/, g e B, then g = T (/) = </, T). This is closer to our situation, so the application of a 
production 29 can be written 

R = (L,p). (2.9) 

The left part is sometimes called bra and the right part ket: (bra, kef). 

Besides dual elements, the adjoint of an operator is also represented using asterisks. 
In our case, the adjoint operator of T, represented by T*, is formally defined by the 
identity: 

(L,Tp) = (T*L,p). (2.10) 

Roughly speaking, T is an operator (a function) that modifies a production, being its 
output a production again, so the left hand side in (2.10) is equivalent to T (p) (L) , and 
the right hand side is just p (T*L). Note that T (p) is a production and T*L is a simple 
digraph. 

In quantum mechanics the possible states of a quantum mechanical system are repre- 
sented by unit vectors - state vectors - in a Hilbert space Ti. or state space (equivalcntly, 
points in a projective Hilbert space). Each observable - property of the system - is de- 
fined by a linear operator acting on the elements of the state space. Each eigenstate of an 

28 Complete in the same sense as for Hilbert spaces 

29 See Sec. 4.1 for definitions. 
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observable corresponds to an eigenvector of the operator and the eigenvalue to the value 
of the observable in that eigenstate. An interpretation of <V| 0) is the probability ampli- 
tude for the state ip to collapse into the state <fi, i.e. the projection of ip over (f>. In this 
case, the notation can be generalized to metric spaces, topological vector spaces and even 
vector spaces without any topology (close to our case as we will deal with graphs with- 
out introducing notions such as metrics, scalar products, etcetera). Two recommended 
references are [37] and [65]. 

This disgression on quantum mechanics is justified because along the present con- 
tribution we would like to think in graph grammars as having a static definition which 
provokes a dynamic behaviour and the duality between state and observable. Besides, 
the use of the notation, we would like to keep some "physical" (mechanics) intuition 
whenever possible. 

2.6 Group Theory 

One way to introuce group theory is to define it as the part of mathematics that study 
those structures for which the equation a ■ x = b has a unique solution. There is a very 
nice definition due to James Newman [56] that I'd like to reproduce: 

The theory of groups is a branch of mathematics in which one docs something to 
something and then compares the results with the result of doing the same thing 
to something else, or something else to the same thing. 

We will be interested in groups, mainly in its notation and basic results, when dealing 
with scqucntialization in Chaps. 4 and 6. A group G is a set together with an operation 
(G, •) that satisfies the following axioms: 

1. Closure: Va, b e G, a • b e G. 

2. Associativity: Va, b, c e G, a ■ (b ■ c) = (a ■ b) ■ c. 

3. Identity element: 3e e G such that a ■ e = e ■ a = a. 

4. Inverse element: Va e G 36 6 G such that a ■ b = e = b ■ a. 

Actually, the third and fourth axioms can be weakened as only one identity per axiom 
should suffice, but we think it is worth stressing the fact that if they exist then they work 
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on both sides. Normally, the inverse element of a is written a -1 . At times the identity 
element is represented by 1q or Og, depending on the notation (abelian or non-abelian) . 
A group is called abelian or commutative if Va, b e G, a • b = b ■ a. 

A group S inside a group G is called a subgroup. If this is the case, we need S to 
be closed under the group operation, it also must have the identity element e and every 
element in S must have an inverse in S. If S cz G and Va, b e S we have that a ■ b^ 1 e S 
then S is a subgroup. Lagrange's theorem states that the order of a subgroup (number 
of elements) necessarily divides the order of the group. 

We are almost exclusively interested in groups of permutations: For a given sorted 
set, a change of order is called a permutation. This does not reduce the scope because, 
by Cayley's theorem, every group is isomorphic to some group of permutations. 

A transposition is a permutation that exchanges the position of two elements whilst 
leaving all other objects unmoved. It is known that any permutation is equivalent to a 
product of transpositions. Furthermore, if a permutation can result from an odd number 
of transpositions then it can not result from and even number of permutations, and 
viceversa. A permutation is even if it can be produced by an even number of exchanges 
and odd in the other case. This is called parity. 

The signature of a permutation a, sgn(a), is +1 if the permutation is even and — 1 
if it is odd. This is the Levi-Civita symbol as introduced in Sec. 2.4 if it is extended for 
non-injectivc maps with value zero. 

Any permutation can be decomposed into cycles. A cycle is a closed chain inside a 
permutation (so it is a permutation itself) which enjoy some nice properties among which 
we highlight: 

• Cycles inside a permutation can be chosen to be disjoint. 

• Disjoint cycles commute. 

Any permutation can be written as a two row matrix where the first row represents the 
original ordering of elements and the second the order once the permutation is applied. 
Example. nThe permutation a can be decomposed into the product of three cycles: 



12 3 4 5 6 7 8 
3 5 7 8 2 4 16 



(1 3 7) (2 5)(4 8 6). 
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Note that this decomposition is not unique because any decomposition into transpo- 
sitions would do (and there are infinitely many).B 

If the permutation turns out to be a cycle, then a clearer notation can be used: Write 
in a row, in order, the following element in the permutation. In the example above we 
begin with 1 and note that 1 goes to 3, which goes to 7, which goes back to 1 and hence 
it is written (1 3 7). 

A cycle with an even number of elements is an odd permutation and a cycle with 
an odd number of elements is an even permutation. In practice, in order to determine 
whether a given permutation is even or odd, one writes the permutation as a product of 
disjoint cycles: The permutation is odd if and only if this factorization contains an odd 
number of even-length cycles. 

2.7 Summary and Conclusions 

In this chapter we have quickly reviewed some basic facts of mathematics that will be 
used throughout the rest of the book: The basics of first order, second order and monadic 
second order logics, some constructions of category theory such as pushouts and pullbacks 
together with the introduction of some categories useful for future chapters, graph theory 
basic definitions and compatibility, tensor algebra and functional analysis notations and 
some basic group theory, paying some attention to permutations. 

Internet is full of very good web pages introducing these branches of mathematics 
with deeper explanations and plenty of examples. It is not possible to give an exhaustive 
list of all webpages visited to make this chapter. Nevertheless, I would like to highlight 
the very good job being performed by the community at http://planetmath.org/ and 
http://www.wikipedia.Org/.from where many examples presented 

Next chapter summarizes current approaches to graph grammars and graph trans- 
formation systems, so it is still introductory. We will put our hands on Matrix Graph 
Grammars in Chap. 4. 
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Graph Grammars Approaches 



Before moving to Matrix Graph Grammars it is necessary to take a look at other ap- 
proaches to graph transformation to "get the taste" , which is the aim of this chapter. 
We will see the basic foundations leaving comparisons of more advanced topics (like 
application conditions) to sporadic remarks in future chapters. 

Sections 3.1 and 3.2 are devoted to categorical approaches, probably the most devel- 
oped formalitations of graph grammars. On the theoretical side, very nice ideas have put 
at our disposal the possibility of using category theory and its generalization power to 
study graph grammars, but even more so, a big effort has been undertaken in order to fill 
the gap between category theory and practice with tools such as AGG (see [22]). Please, 
refer to [1] for a detailed discussion and comparison of tools. 

In Sees. 3.3 and 3.4 two completely different formalisms to the categorical approach 
are summarized, at times called set-theoretic or even algorithmic approaches. They are 
in some sense closer to implementation than those using category theory. There has been 
a lot of research in these two essential approaches so unfortunately we will just scratch 
the surface. 

Interestingly it is possible to study graph transformation using logics, providing us 
with all powerful methods from this branch of mathematics, monadic second order logics 
in particular. We will brush over this brilliant approach in Sec. 3.5. 

To finish this review we will briefly touch on the very interesting relation- algebraic 
approach in Sec. 3.6, which has not attracted as much attention as one should expect. 
Finally, the chapter is closed with a summary in Sec. 3.7. 
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In this chapter we abuse of bold letters with the intention of facilitating the search 
of some definition or result. It is assumed that this chapter as well as Chap. 2 will be 
mainly used for reference. 

3.1 Double PushOut (DPO) 
3.1.1 Basics 

In the DPO approach to graph rewriting, a direct derivation is represented by a double 
pushout in category Graph (multigraphs and total graph morphisms). Productions can 
be defined as three graph components, separating the elements that should be preserved 
from the left and right hand sides of the rule. 

A production p : [L < — K — > R) consists of a production name p and a pair of 
injective graph morphisms I: K — » L and r: K — > R. Graphs L, R and K are respectively 
called the left-hand side (LHS), right-hand side (RHS) and the interface of p. Morphisms 
I and r are usually injective and can be taken to be inclusions without loss of generality. 
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Fig. 3.1. Example of Simple DPO Production 



The interface K of a production consists of the elements that should be preserved by 
the production application, while elements in L — K are deleted and elements of R — K 
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are added. Figure 3.1 shows a simple DPO production named del, that can be applied 
if a path of three nodes is found. If so, the production eliminates the last node and all 
edges and creates a loop edge in the second node. 

A direct derivation can be defined as an application of a production to a graph 
through a match by constructing two pushouts. A match is a total morphism from the 
left hand side of the production onto the host graph, i.e. it is the operation of finding 
the LHS of the grammar rule in the host graph. Thus, given a graph G, a production p 
: (L < — K — > R) and a match to : L — > G, a direct derivation from G to H using p 
(based on to) exists if and only if the diagram in Fig. 3.2 can be constructed, where both 
squares are required to be pushouts in category Graph. 

In Fig. 3.2, red dotted arrows represent the morphisms that must be defined in order 
to close the diagram, i.e. to construct the pushouts. D is called the context graph. In 
particular, if the context graph can not be constructed then the rule can not be applied. 

A direct derivation is written G ^=^> H or simply G H if the production and the 
matching are known from context. 
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Fig. 3.2. Direct Derivation as DPO Construction 



Figure 3.1 shows the application of rule del to a graph. Morphisms to, d and to* are 
depicted by showing the correspondence of the vertices in the production and the graph. 

In order to apply a production to a graph G, a pushout complement has to be calcu- 
lated to obtain graph D. The existence of this pushout complement is guaranteed if the 
so-called dangling and identification conditions are satisfied. The first one establishes 
that a node in G cannot be deleted if this causes dangling edges. The second condition 
states that two different nodes or edges in L cannot be identified (by means of a non- 
injective match) as a single element in G if one of the elements is deleted and the other 
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is preserved. Moreover, the injectivity of I: K — > L guarantees the uniqueness of the 
pushout complement. The identification condition plus the dangling condition is at times 
known as gluing condition. 

In the example in Fig. 3.1 the match (1, 2, 3) >-> (a, 6, c) does not fulfill the dangling 
condition, as the deletion of node d would make edges (a, c) and (c, d) become dangling, 
so the production cannot be applied at this match. One example (for SPO, but it can be 
easily translated into DPO) in which the identification condition fails is depicted on the 
right of Fig. 3.7 on p. 50. 

3.1.2 Sequentialization and Parallelism 

l r 

A graph grammar can be defined as Q = ((p : L < — K — > R) P eP, Go) (see [11], Chap. 
3), where (p : L K — — > R) ps p is a family of productions indexed by their names and 
Go is the starting graph of the grammar. The semantics of the grammar are all reachable 
graphs that can be obtained by successively applying the rules in Q. Events changing a 
system state can thus be modeled using graph transformation rules. 

In real systems, parallel actions can take place. Two main approaches can be followed 
in order to describe and analyze parallel computations. In the first one, parallel actions 
are sequentialized, giving rise to different interleavings (for example a single CPU simu- 
lating multitasking). In the second approach, called explicit parallelism, actions are really 
simultaneous (more than one CPU performing several tasks). 




Fig. 3.3. Parallel Independence 



In the interleaving approach, two actions (rule applications) are considered to be 
parallel if they can be performed in any order yielding the same result. This can be 
understood in two different ways. 
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The first interpretation is called parallel independence and states that two alter- 
native direct derivations Hi <== G ==> H2 are independent if there are direct derivations 
such that Hi =^> X <= H2 (see Fig. 3.3). That is, both derivations are not in conflict, 
but one can be postponed after the other. It can be characterized using morphisms in 
a categorical style saying that two direct derivations (as those depicted in Fig. 3.3) are 
parallel independent if and only if 

3i : Li — » D2, j : £2 - > ~D\ | i| ° i = mi, l* a j = mi- (3-1) 

If one element is preserved by one derivation, but deleted by the other, then the latter 
is said to be weakly parallel independent of the first (it is characterized in equation 
3.4). Thus, parallel independence can be defined as mutual weak parallel independence. 

On the other hand, two direct derivations are called sequential independent if 
they can be performed in different order with no changes in the result. That is, both 
G =^> Hi =S^> X and G =^> H2 ==> X yield the same result (see Fig. 3.4). Again, 
categorically we say that two derivations are sequential independent if and only if 

3i : Ri — > D 2 , j : L 2 — > D\ \l* ° i = ml , r* o j ' = m 2 . (3.2) 

Mind the similarities with confluence (problem 5) and local confluence. 




Fig. 3.4. Sequential Independence 



The conditions for sequential and parallel independence are given in the Local Church 
Rosser theorem [11], Chaps. 3 and 4. It says that two alternative parallel derivations 
arc parallel independent if their matches only overlap in items that arc preserved. Two 
consecutive direct derivations are sequential independent if the match of the second does 
not depend on elements generated by the first, and the second derivation does not delete 
an item that has been accessed by the first. Moreover, if two direct alternative derivations 
are parallel independent, their concatenation is sequential independent and viceversa. 
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The explicit parallelism view [2; 11] abstracts from any application order (no in- 
termediate states are produced). In this approach, a derivation is modeled by a single 
production, called parallel production. Given two productions, pi and p 2 , the parallel 
production p\ + p 2 is the disjoint union of both. The application of such production is 
denoted as G p l^ s x. 

Two problems arise here: The sequentialization of a parallel production (analysis), 
and the parallclization of a derivation (synthesis). In DPO, the parallelism theorem states 

Pi ~\~Pl Pi 

that a parallel derivation G X can be sequentializcd into two derivations (G 

Pi Pi Pi 

Hi X and G H 2 => X) that are sequential independent. Conversely, two 
derivations can be put in parallel if they are sequentially independent. 

This is a limiting case of amalgamation, which specifies that if there are two pro- 
ductions pi and p 2 , then the amalgamated production pi ® Po p 2 is defined such that the 
production pi and p 2 can be applied in parallel and the amalgamated production p 0 (that 
represents common parts of both) should be applied only once. 

The concurrency theorem 1 deals with the concurrent execution of productions 
that needs not to be sequentially independent. Hence, according to previous results, it is 
not possible to apply them in parallel. Anyway, they can be applied concurrently using 
a so-called E-concurrent graph production, pi *eP2- We will omit the details, which can 
be consulted in [22]. 

Let the sequence G P ^J 1 h 1 P ^J 2 jj 2 be given. It is possible to construct a direct 
derivation G Pl J^ 2 jj 2 . The basic idea is to relate both productions through an over- 
lapping graph E, which is a subgraph of Hi, E = m*(Ri) u m 2 (L 2 ). The corresponding 
restrictions to* : Ri — > E and m 2 : L 2 — » E of to* and m 2 , respectively, must be jointly 
surjective. Also, any direct derivation G Pl = J^T 2 H 2 can be sequentializcd. 

3.1.3 Application Conditions 

We will make a brief overview of graph constraints and application conditions. In [14], 
graph constraints and application conditions were developed for the Double Pushout 

1 The concurrency theorem appeared in [22] for the first time, to the best of our knowledge. A 
somehow related concept - more general, though - was introduced simultaneously for Matrix 
Graph Grammar in [59]. We will review it in Sec. 6.4. 
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(DPO) approach to graph transformation and generalized to adhesive HLR categories in 
[22]. Atomic constraints were defined to be either positive or negative. A positive atomic 
graph constraint PC (c) (where c is an arbitrary morphism c: P — > C) is satisfied by 
graph G if Vmp: P — » G injective morphism there exists some mc'. C — > P injective 
morphism such that mp = mc ° c, mathematically written G (= PC (c) (see left part of 
Fig. 3.5). It can be interpreted as graph C must exist in G if graph P is found in G. 

Graph morphism : L — > G satisfies the positive atomic application condition 
P (c, V™ c i) (with c: L — > P and P — > Ci) if assuming G (= PC (c), for all associated 
morphisms mp: P — > G, 3mcV Ci — > G such that G |= PC (c,-). The notation used 
is rn l |= P (c, \A Cj), having also a similar interpretation to that of graph constraints: 
Suppose L is found in G, if P is also in G then there must be some Ci in G. Refer 
to the diagram on the right side of Fig. 3.5. A positive graph constraint is a boolean 
formula over positive atomic graph constraints. Positive application conditions, negative 
application conditions and negative graph constraints arc defined similarly. 




Fig. 3.5. Generic Application Condition Diagram 



Finally, an application condition AC (p) = [A^^Ap) for a production p : L — > R 
consists of a left application condition Al over L (also known as precondition) and a right 
application condition or postcondition Ap over R. A graph transformation satisfies the 
application condition if the match satisfies Al and the comatch satisfies Ap. In [14; 32] it 
is shown that graph constraints can be transformed into postconditions which eventually 
can be translated into preconditions. In this way, it is possible to ensure that starting 
with a host graph that meets certain restrictions, the application of the production will 
output a graph that still satisfies the same restrictions. 

DPO approach has been embedded in the weak adhesive HLR categorical approach, 
which we will shortly review in the following subsection. 
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3.1.4 Adhesive HLR Categories 

This section finishes with a celebrated generalization of DPO.lt was during 2004 that 
adhesive HLR categories were defined by merging two striking ideas: Adhesive cat- 
egories [43] and high level replacement systems [16; 17]. See 2.2 for a quick overview of 
category theory. 

Basic definitions are extended almost immediately to adhesive HLR systems (C,A4). 
A production p : (L < — K — > R) consists of three objects L, K and R, the left hand 
side, the gluing object and the right hand side, respectively, and morphisms I : K —> L 
and r : K — » R with I, r e A4. There is a slight change in notation and the term derivation 
is subsituted by transformation, and direct derivation by direct transformations. 
Adhesive HLR grammars and languages are defined in the usual way. 

In order to apply a production we have to construct the pushout complement and a 
necessary and sufficient condition for it is the gluing condition. For adhesive HLR systems 
this is possible if we can construct initial pushouts, which is an additional requirement 
(it does not follow from the axioms of adhesive HLR categories): A match m : L — » G 
satisfies the gluing condition with respect to a production p : (L < — K — > R) if for the 
initial pushout over m in Fig. 3.6 there is a morphism / : X — > K such that r o / = /. 
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Fig. 3.6. Gluing Condition 



Parallel and sequential independence are defined analogously to what have been pre- 
sented in Sec. 3.1 and the local Church-Rosser and the parallelism theorems remain 
valid. 
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3.2 Other Categorical Approaches 

This section presents other categorical approaches such as single pushout (SPO) and 
pullback and compares them with DPO (Sec. 3.1). 

In the single pushout approach (SPO) to graph transformation, rules are modeled 
with two component graphs (L and R) and direct derivations are built with one pushout 
(which performs the gluing and the deletion). SPO relies on category Graph p of graphs 
and partial graph morphisms. 

A SPO production p can be defined as p : (L R), where r is an injective partial 
graph morphism. Those elements for which there is no image defined are deleted, those 
for which there is image are preserved and those that do not have a preimage are added. 

A match for a production p in a graph G is a total morphism m: L — » G. Given a 
production p and a match m for p in G, the direct derivation from G is the pushout of 
p and m in Graph p . As in DPO, a derivation is just a sequence of direct derivations. 

Left part of Fig. 3.7 shows an example of the rule in Fig. 3.1, but expressed in the 
SPO approach. The production is applied to the same graph G as in Fig. 3.2 but at a 
different match. 

An important difference with respect to DPO is that in SPO there is no dangling 
condition: Any dangling edge is deleted (so rules may have side effects). In this example, 
node c and edges (a, c) and (c, d) are deleted. In addition, in case of a conflict with 
the identification condition due to a non-injective matching, the conflicting elements are 
deleted. 

Due to the way in which SPO has been defined, even though the matching from 
the LHS into the host graph is a total morphism, the RHS matching can be a partial 
morphism (see the example on the right of Fig. 3.7). 

In order to guarantee that all matchings are total it is necessary to ask for the 
conflict-free condition: A total morphism m : L — > G is conflict free for a production 
p : L — > i? if and only if 

m{x) = m(y) => [x, y e dom(p) or x, y $ dom(p)] . (3.3) 

Results for explicit parallelism are slightly different in SPO. In this approach, a paral- 

Pl ~\-P2 Pi P2 P2 

lei direct derivation G X can be sequentialized into G => H± => X if G => Hi is 
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Fig. 3.7. SPO Direct Derivation 

weakly parallel independent of G ==5> -Hi (and similarly for the other sequentialization). 
So as this condition may not hold, there are parallel direct derivations that do not have 
an equivalent interleaving sequence. 
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Fig. 3.8. SPO Weak Parallel Independence 
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These conditions will be written explicitcly because we will make a comparison in 
Sec. 6.1. Derivation d\ is weakly parallel independent of derivation (see Fig. 3.8) if 
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Fig. 3.9. SPO Weak Sequential Independence 
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There is an analogous concept, similarly defined, known as weak sequential in- 
dependence. Let g?i and d 2 be as denned in Fig. 3.9, then d 2 is weakly sequentially 
independent of d\ if 

m 2 (L 2 )nmt(Ri\ P i(L 1 )) = 0. (3.5) 

If additionally 

m* (Ri) nm 2 (L 2 \dom(p 2 )) = 0 (3.6) 

then d 2 is sequentially independent of d\. 

It is possible to synthesize both concepts (weak sequential independence and parallel 
independence) in a single diagram. See Fig. 3.10. 




Due to the fact that approaches based on the pushout construction can not replicate 
substructures naturally, Baudcron and others have proposed a different setting by using 
pullbacks instead of pushouts [3; 4; 5]. We will call them SPB and DPB approaches, 
depending on the number of pullbacks, similarly to SPO and DPO. 

Note that pullbacks are subobjects of products (see Sec. 2.2) and that products are (in 
some sense) a natural replication mechanism. It has been shown that pullback approaches 
are strictly more expressive than those using pushouts, but they have some drawbacks 
as well, such as: 
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1. The existence condition for pullback complements is much more complicated than 
with pushouts (gluing condition). 

2. In general, this condition can not be treated with computers [36]. 

3. There is a loss in comprehensibility and intuitiveness. 

In Fig. 3.11 what we understand by a replication that can be handled easily with 
SPB but not with SPO is illustrated. The pullback construction is depicted in dashed 
red color on the same production, which is drawn twice. On the left, the production on 
top with the morphism back to front (its LHS on the right and viceversa) and the system 
evolves from left to right (as in SPO or DPO), i.e. the initial state is Hi and the final 
state is H 2 . 

On the right of the same figure the production is represented more naturally for us 
(the left hand side on the left and the right hand side on the right) but on the bottom 
of the figure. The system evolves on top from right to left (it should be more intuitive if 
it evolved from left to right). Besides, we notice that what we understand as the initial 
state is now given by the RHS of the production while the final state is given by the left 
hand side. 2 
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Fig. 3.11. SPB Replication Example 



2 Anyway, this is not misleading with some practice. 
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3.3 Node Replacement 

Node Replacement grammars [23] (Chap. 1) are a class of graph grammars based on the 
replacement of nodes in a graph. The scheme is similar to the one described in Sec. 1.1, 
on p. 3 but with some pecularities and notational changes. There is a mother graph 
(LHS, normally it consists of a single node) and a daughter graph (RHS) together 
with a gluing construction that defines how the daughter graph fits in the host graph 
once the substitution is carried out. Nodes of the mother graph play a similar role to 
non-terminals in Chomsky grammars. The differences among different node replacement 
grammars reside in the way the gluing is performed. 

We will start with NLC grammars (Node Label Controlled, [23], Chap. 1) which are 
defined as the 5-tuple 

G=(E,A,P,C,S) (3.7) 

where S are all node labels (alphabet set), A are node labels (A c E) that do not appear 
on the LHS of any production (alphabet set of terminals, so non-terminals are £ — A), 
P is the set of productions, C are the gluing conditions (connection contructions) and S 
is the initial graph. 

Here only node labels matter. Each production is defined as a non-terminal node 
producing a graph with terminals and non-terminals along with a set of connection 
instructions. For example, in Fig. 3.12 we see a production p with X in its LHS and a 
subgraph in its RHS along with a connection relation c in the box. 

Production application (its semantics, also in Fig. 3.12) consists of deleting the LHS 
from the host graph, add the RHS and finally connect the daughter graph with the start 
graph. There are no application conditions. 

The linking part is performed according to a connection relation, which is a pair of 
node labels of the form (x,y): If the left hand side node was adjacent to a node labeled 
x then all nodes in the RHS with label y will be adjacent to it. 

NLC is a class of context-free graph grammars, in particular recursively defined prop- 
erties can be described. Also, they are completely local and have no application conditions 
which allows to model derivations by derivation trees. However, the yield of a derivation 
tree is dependant on the order in which productions are applied. This property is known 
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Fig. 3.12. Example of NLC Production 



as confluence (see problem 5) and the subclass of NLC grammars that are confluent is 
called C-NLC. 

At times it is desirable to refer to a concrete node instead of to a whole family 
in the gluing instructions. This variation is known as NCE grammar (Neighbourhood 
Controlled Embedding) and is formally defined to be the tuple 

G = (27, A, P, S) (3.8) 

where 27, A and S arc defined as above but productions in set P arc different. 

The grammar rule p : X — » (D, C) contains the production p : X — » D and the 
connection C. The connection is of the form (u, x) where u is a label and a; is a particular 
node in the daughter graph. Note that NCE graph grammars are still NLC-like grammars, 
at least concerning replacement. 

NCE can be extended in several ways but the most popular one is adding labels and 
a direction to edges, giving rise to edNCE grammars. Productions in edNCE are equal 
to those in NCE but connections differ a little bit, being of the form 

(fi,p/q,x,d), (3.9) 

where fi is a node label, p and q are edge labels, x is a node of D and d e {in, out] (which 
specifies the direction of the edge). For example, if d = in the connection in (3.9), it 
specifies that the embedding process should establish an edge with label q to node x of 
D from each //-labeled p-neighbour ofmeM (the mother graph) that is an in-neighbour 

of TO. 
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The expressive power of edNCE is not increased neither if grammar rules change 
directions of edges nor if connection instructions make use of multiple edges. 

The graphical representation differs a little from that of DPO and SPO. The daughter 
graph D is included in a box and the area surrounding it represents its environment. Non- 
terminal symbols are represented by capital letters inside a small box (the large box itself 
can be viewed as a non-terminal symbol) . Connection instructions are directed lines that 
connect nodes inside (new labels) with nodes outside (old labels). 




Fig. 3.13. edNCE Node Replacement Example 



Example. nThe notation G = Hi\n/H{\ is employed for a derivation, meaning that 
graph G is obtained by making the subsitution n >— > H\ in Hi, i.e. by replacing node n 
in Hi with graph H\. In the example of Fig. 3.13 (with non-terminal node N) we have 
substituted the non-terminal node in Hi by Hi attaching nodes according to labels in 
arrows (a) to get GM 

Associativity - reviewed in the next section - is a natural property to be demanded on 
any context-free rewriting framework and is enjoyed by edNCE grammars. Some edNCE 
grammars are context-dependant because they do not need to be confluent, i.e. the result 
of a derivation may depend on the order of application of its productions. The class of 
confluent edNCE grammars is represented by C-edNCE. 

C-edNCE grammars fulfill some nice properties such as being closed under node or 
edge relabeling. It is possible to define the notion of derivation tree as in the case of 
context-free string grammars (see [23], Chap. 1). 

Many subclasses of edNCE grammars have been - and are being - studied. Just to 
mention some, apart from C-edNCE, B-edNCE (Boundary, in which non-terminal nodes 
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are not connected), 3 B rl( 2-edNCE (non-terminal neighbour deterministic B-edNCE 
grammar), A-edNCE (in every connection instruction (c, (3/ J, x, d) a and x are termi- 
nal) and LIN-edNCE (linear, if every production has at most one non-terminal node). 

3.4 Hyperedge Replacement 

The basic idea is similar to node replacement but acting on edges instead of nodes, i.e. 
edges are substituted by graphs, playing the role of non-terminals in Chomsky grammars 
[23]. 

Hyperedge replacement systems are adhesive HLR categories that can be rewritten 
as DPO graph transformation systems. 

We will illustrate the ideas with an edge replacement example (instead of hyperedge 
replacement, to be defined below) in a very simple case. Suppose we have a graph as 
the one depicted on the left of Fig. 3.14, with a labeled edge e to be substituted by the 
graph depicted on the center of Fig. 3.14, in which the special nodes (1 and 2) are used 
as anchor points. The result is displayed on the right of Fig. 3.14. 

G, G, H=[elG x ] 




Fig. 3.14. Edge Replacement 



A production in essence is what we have done, with a LHS made up of labels and a 
graph as RHS. The notation H = \e/G{\, also G => [e/Gi], is standard to mean that 
graph (hypergraph) H is obtained by deleting edge e and plugging in graph G\. 

3 The daughter does not have edges between non-terminal nodes and in no connection instruc- 
tion (<r, /3/7, x, d) a is non-terminal or, in other words, every non-terminal has a boundary of 
terminal neighbours. 

4 The idea behind this extension is that every neighbour of a non-terminal is uniquely deter- 
mined by their labels and the direction of the edge joining them. Therefore, when rewriting 
the non-terminal, it is possible to distinguish among neighbours. 
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A hyperedge is defined in [23] (Chap. 2) as an atomic item with a label and an 
ordered set of tentacles. Informally, a hypergraph is a set of nodes with a collection 
of hyperedges such that each tentacle is attached to one node (a formal definition is 
provided below). Note that directed graphs are a special case of hypergraphs. Normally 
it is established that the label of a hyperedge is the number of its tentacles. 

For a given string w, the length of the string is denoted by \w\. For a set A, A* is 
the set of all strings over A. The free symbolwise extension /* : A* — > B* of a mapping 
/ : A — > B is defined by 

f*(a 1 ---a k ) = f( ai )---f(a k ) (3.10) 

VfceN and a» e A, i e {1, . . . , k}. 

Let C be a set of labels and let t : C — > N be a typing function. A hypergraph H 
over C is the tuple 

(V,E,aU,lab,ext) (3.11) 

where V is the set of nodes, E the set of hyperedges, att : E — > V* a mapping that assigns 
a sequence of pairwise distinct attachment nodes att(e) to each e e E, lab : E — > C a 
mapping that labels each hyperedge such that t(lab(e)) = \att(e)\ and ext e V* are 
pairwise distinct external nodes. The type of a hyperedge is its number of tentacles and 
the type of a hypergraph is its number of external nodes. The set of hypergraphs will be 
denoted H, or He if we need to explicitely refer to the set of types. 

Two hypergraphs H and H' are isomorphic if there exist i = (iy, Ie), W ■ Hy — > H v , 
ie ■ He — > H' E such that: 

1. i v (att H (e)) =att H ,(i E (e)). 

2. Ve e Eh, labn(e) = lab H ,(i E (e)). 

3. i v (ext H ) = ext H ,. 

As it usually happens in algebra, equality is defined up to isomorphism. 

If R = {ei, . . . ,e n } c Eh is the set of hyperedges to be replaced and there is a 
preserving type function r : R — > H (Ve 6 R, t(r(e)) = t(e)) such that r(e») = then 
we write it both as H[e\/R\, . . . , e n /R n ] or as H[r]. 

Hyperedge replacement belongs to the gluing approaches and follows the high level 
scheme introduced in Sec. 1.1: The replacement of R in H according to r is performed by 
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first removing R from Eh, then Ve e R the nodes and hyperedges of r(e) are disjointly 
added and the i-th external node of r(e) is fused with the i-th attachment node of e. 

If a hyperedge is replaced its context is not affected. Therefore, hyperedge replacement 
provides a context-free type of rewriting as long as no additional application conditions 
are employed. 

There are three nice properties fulfilled by hyperedge replacement grammars that we 
will briefly comment and that can be compared with the problems introduced in Sec. 1.3, 
in particular problems 2, 3 and 5, 6. Let's assume the hypothesis on hyperedges necessary 
so the following formulas make sense: 

• Sequentialization and Parallelism: Assuming pairwise distinct hyperedges, 

H[ei/H lt . . . , e n /H n ] = Hfa/Hi] ■ ■ ■ H[e n /H n ] 

• Confluence: Let e\ and e 2 be distinct hyperedges, 

tf[ei/tfi][e 2 /#2] = ff[e 2 /tf 2 ][ ei /#i]. (3.13) 

• Associativity: 

H^/H^/H?] = H [ e2 /tf 2 [ei/tfi]] . (3.14) 

Note however that in hyperedge replacement grammars, confluence is a consequence 
of the first property which holds due to disjointness of application of grammar rules. 

A production p over the set of non-terminals TV c. C is an ordered pair p = (A, R) with 
A e N, R e H and t(A) = t(R). A direct derivation is the application of a production, 
i.e. the replacement of a hyperedge by a hypergraph. If H e Tt, e e Eh and (labn(e), R) 
is a production then H' = H[e/R] is a direct derivation and is represented by H => H'. 
As always, a derivation is a sequence of direct derivations. 

Formally, a hyperedge replacement grammar is a system HRG = (N, T, P, S) where 
N is the set of non-terminals, T is the set of terminals, P is the set of productions and 
S e N is the start symbol. 

We will finish this section with a simple example that generates the string-graph 
language 5 L (A n B n ) = {(a n b n ) \n ^ 1}- This is the graph-theoretic counterpart of the 
Chomsky language that consists of strings of the form (a n b n ), n ^ 1, i.e. that has any 

5 This example is adapted (simplified) from one that appears in [23], Chap. 2. 
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string with an arbitrary finite number of a's followed by the same number of b's, e., 
aabb, aaabbb, etcetera. 



Pi \ 1 



L 2 \ R 2 



2 „ b _2 



Fig. 3.15. String Grammar Example 



A black filled circle • represents an external node while non-filled circles o are internal 
nodes. A box represents a hyperedge with attachments with the label inscribed in the 
box. A 2-edge is represented by an arrow joining the first node with the second. 

The grammar is defined as A n B n = ({S}, {a, b}, P, S), where the set of productions 
P = {pi,P2} is depicted in Fig. 3.15. Production pi is necessary to get the graph-string 
ab and to stop rule application. The start graph and an evolution of the grammar - 
derivation p\\p2\P2 ~ can be found in Fig. 3.16. 




Fig. 3.16. String Grammar Derivation 
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3.5 MSOL Approach 

It is possible to represent graphs as logical structures, expressing their properties by 
logical formulas or, in other words, use logical formulas to characterize classes of graphs 
and to establish their properties out of their logical description. In this section we will 
give a brief introduction to monadic second order logics for graph transformation. Refer 
to Chap. 5 of [23] and references therein cited. 

Currently it is not possible to define graph transformation in terms of automata (recall 
that in language theory it is essential to have transformations that produce outputs while 
traversing words or trees). Quoting B. Courcelle (Chap. 5 of [23]): 

The deep reason why MSOL logic is so crucial is that it replaces for graphs (. . .) 
the notion of a finite automaton (...) 

The key point here is that these transformations can be defined in terms of MSOL 
formulas (called definable transductions). 

Graph operations will allow us to define context-free sets of graphs as components 
of least solutions of systems of equations (without using any graph rewriting rule) and 
recognizable sets of graphs (without using any notion of graph automaton). 

Graphs and graph properties are represented using logical structures and relations. 
A binary relation R c A x B is a multivalued 6 partial mapping that we will call 
transduction. Recall from Sec. 2.1 that an interpretation in logics in essence defines 
semantically a structure in terms of another one, for which MSOL formulas will be used. 

Let TZ be a finite set of relation symbols and let p(R) be the arity of R e TZ. An 
^-structure is the tuple S = (D, (R)r € tz) such that D is the (possibly infinite) domain 
of S and each R is a p(i?)-ary relation on D, this is, a subset of D p ( R \ The class of 
^-structures is denoted by STRiJZ). 

As an example of structure, for a simple digraph G made up of nodes in V we have 
the associated TvL-structure \G\ 1 = (V,edg), where (x,y) e edg if and only if there is an 
edge starting in x and ending in y. Note that this structure represents simple digraphs. 

The set of monadic second order formulas over TZ with free variables in y is represented 
by MS (TZ, y). As commented in Sec. 2.1, languages defined by MSOL formulas are regular 
languages. 

6 One element may have several images. 
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Let Q and 1Z be two finite ranked sets of relation symbols and W a finite set of set 
variables (the set of parameters). A (1Z, Q)-definition scheme is a tuple of formulas of 
the form: 



The aim of these formulas is to define a structure T in STR(Q) out of a structure S 
in STR(TZ). The notation needs some comments: 

• <fi e MS(1Z, W) defines the domain of the corresponding transduction, i.e. T is defined 
if <fi is true for some assignment in S of values assigned to the parameters. 

• ipi e MS(7£, W u {xi}) defines the domain of T as the disjoint union of elements in 
the domain of S that satisfy if>i for the considered assignment. 

• 6 W e MS(1Z,W u {xi, . . . , £p( g )}) for w = (q,j) e Q*k, where we define Q*k = 
{w | q e Q,j e [/c] p ( 9 )} and [k] = {1, . . . k}, k 6 N. Formulas #„, define the relation q T . 

For a more rigurous definition with some examples, please refer to [23], Chap. 5. The 
important fact of transductions is that they keep monadic second order properties, i.e. 
monadic second order properties of S can be expressed as monadic second order prop- 
erties in T. Furthermore, the inverse image of a MS-definable class of structures under 
a definable transduction is definable (not so for the image), as well as the composition 
and the intersection of a definable structure with the cartesian product of two definable 
structures. However, there are some "negative" results apart from the image, e.g. the 
inverse of a definable transduction is not dcfineable neither is the intersection of two 
definable transductions. 

The theory goes far beyond, for example by defining context free sets of graphs by 
systems of recursive equations, generalizing in some sense the concatenation of words in 
string grammars. No attention will be paid to rigurous details and definitions (again, see 
Chap. 5 in [23]) but a simple classical example of context free grammars will be reviewed: 
Let A = {ai, . . . , a n } be a finite alphabet, e the empty word and A* the set of words over 
A. Let's consider the context-free grammar G = {u — > auuv, u — > avb, v — > avb, v — > ab}. 
The corresponding system of recursive equations would be: 




(3.15) 



S = (u = a.(u.(u.v)) + a.(v.b), v = a.(v.b) + a.b} 
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where "." is the concatenation. It is possible, although we will not see it, to express node 
replacement and hyperedge replacement in terms of systems of recursive equations. 

Analogously to the way in which the equational set extends context-freeness, recog- 
nizable sets extend regular languages. For example, it is possible to show that every set of 
finite graphs or hypergraphs defined by a formula of an appropriate monadic second order 
language is recognizable with respect to an appropriate set of operations (the converse 
also holds in many cases). 

3.6 Relation-Algebraic Approach 

We will mainly follow [51] and [36] in this section, paying special attention to the justi- 
fication that the category Graph p has pushouts, which will be used in Chap. 5 for one 
of the definitions of direct derivation. 

We will deviate from standard relational methods 7 notation in favor of other which 
is probably more immediate for mathematicians not acquainted with it and, besides, we 
think cases comparison with the rest of the approaches in this chapter. 

A relation r\ from S\ to S 2 is a subset of the cartesian product S\ x S 2 , denoted 
by n : Si — - S 2 . Its inverse r _1 : S 2 — - Si is such that (s 2 ,si) e r^ 1 (si,s 2 ) e ri. If 
Ti '■ Si —r S3 is a relation, the composition r 2 ri = r 2 o n : Si — - S3 is again a relation 
such that 



As relations are sets, naive set operations are available such as inclusion (c), inter- 
section (n), union (u) and difference (— ). It is possible to form the category Rel of sets 
and relations (the identity relation I5 = S S is the diagonal set of S x S), which 
besides fulfills the following properties: 



(si,s 3 ) £r 2 ono [3s 2 e S 2 \ (si,s 2 ) e n, (s 2 , s 3 ) e S 2 ] . 



(3.16) 




• (r 2 o n ) 1 = r l 1 o r 2 1 . 

• Distributive law: r 2 o ({J aeJ i(ra)) 0 ri 



U aS A (r2°r a o ri ). 



7 Visit the RelMiCS initiative at http://www2.cs.unibw.de/Proj/relmics/html/. 
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A relation / : Si —> Si such that / o f~ l c lg 2 is called a partial function and it is 
represented with an arrow instead of a harpoon, / : S± — > If Is-l ^ / _1 o / also, then 
it is called a total function. Note that these are the standard set-theoretic definitions 
of partial function and total function. The function / is injective if / _1 o / = l Sl and 
surjective if / o f" 1 = l Sa . 

The category of sets and partial functions is represented by Set p . It can be proved 
that Set p has small limits and colimits, so in particular it has pushouts. 

For a relation r : S — - T its domain is also a relation d : S S and is given by the 
formula d(r) = (r _1 or) n I5. 

In order to define graph rewriting using relations we need a relational representation 
of graphs. A graph (S, r) is a set S plus a relation r : S S. A partial morphism 
between graph (Si, n) and (52, r 2 >, p : Si — > S2, is a partial function p such that: 

p o n o cr 2 op. (3-17) 

It is not difficult to see that the composition of two partial morphisms of graphs 
is again a partial morphism of graphs. It is a bit more difficult (although still easy to 
understand) to show that the category Graph p of simple graphs and partial morphisms 
has pushouts (theorem 3.2 in [51]). The square depicted in Fig. 3.17 is a pushout in Set p 
if the formula for the relation h is given by: 

h = (to* o r o to*" 1 ) u (p* ogo p*" 1 ) . (3.18) 

A production is defined similarly to the SPO triple of two graphs (L, I), 

(R,r} and a partial morphism p : L — > R. A match for p is a morphism of graphs 
M : (L,V) — > (G,g}. A production plus a match is a direct derivation. As always, a 
derivation is a finite sequence of direct derivations. 

Equation (3.18) defines a pushout in category Set p which is different than a rewriting 
square (a direct derivation). If we want the rewriting rule to be a pushout, the relation 
in (H, K) must be denned by the equation: 

h = (to* or o to* -1 ) u [p* o [g - to -1 o I o to) cp*" 1 ] . (3.19) 

The relation-algebraic approach is based almost completely in relational methods. To 
illustrate the main differences with respect to categorical approaches an example taken 
from [36] follows that deals with categorical products. 
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(L,l) 



p 



(R,r) 



m 



* 



Y 



* 



Y 



.> (H,h) 



Fig. 3.17. Pushout for Simple Graphs (Relational) and Direct Derivation 



Example. oln order to define the categorical product - see Sec. 2.2 - it is necessary 
to check the universal property of being a terminal object, which is a global condition 
(it should be checked against the rest of candidate elements, in principle all elements in 
the category). In contrast, in relation algebras, the direct product of two objects X and 
Y is a triple (P, Tlx-, Hy) satisfying the following properties: 

• n x o n x l = l x and n Y o Ily 1 = ly. 

• n Y o n x x = u. 

• (n- 1 o n x ) n (n- 1 o 7i y ) = i P . 

where U is the universal relation (to be defined below) . Note that this is a local condition, 
in the sense that it only involves functions without quantification (in category theory this 
sort of characterizations are more like for all objects in the class there exists a unique 
morphism such that...)M 

The relational approach is based on the notion of allegory which is a category C as 
defined in Sec. 2.2 - the underlying category - plus two operations ( _1 and n) with the 
following properties: 8 

• 1 ) = t; (r o s) 1 = s _1 o r _1 ; (n n r%) 1 = r^ 1 n r^ 1 ■ 

• ri o (r 2 n r 3 ) c (n o r 2 ) n (ri o r 3 ). 

• Modal rule: (ri n 7*2) o r3 c n o [r3 n (r 2 o r^ 1 )]. 

The universal relation U for two objects X and Y in an allegory is the maximal 
element in the set of morphisms from X to Y, if it exists. If there is a least element, then 
it is called an empty relation or a zero relation. 

8 Compare with those on p. 62. 
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It is possible to obtain the other modal rule starting with the axioms of allegories: 
(n o r 2 ) c r 3 c [r 3 n (r 2 o r^ 1 )] o r 2 , (3.20) 
which can be synthesized in the so-called Dedekind formula: 

(n o r 2 ) o r 3 c [r 3 n (r 2 o r" 1 )] o [r 3 n (r 2 o rf 1 )] . (3-21) 

A locally complete distributive allegory is called a Dedekind category. A distrib- 
utive allegory is an allegory with joins and zero element; locally completeness refer to 
distributivity of composition with respect to joins. 

By using Dedekind categories [36] provides a variation of the DPO approach in which 
graph variables and replication is possible. We will not introduce it here because it would 
take too long, due mainly to notation and formal definitions, and it is not used in our 
approach. 

As a final remark, [36] proceeds by defining pushouts, pullbacks, complements and 
an amalgamation of pushouts and pullbacks (called pullouts) over Dedekind categories 
to define pullout rewriting. 

3.7 Summary and Conclusions 

The intention of this quick summary is to make an up-to-date review of the main ap- 
proaches to graph grammars and graph transformation systems: Categorical, relational, 
set-theoretical and logical. The theory developed so far for any of these approaches goes 
far beyond what has been exposed here. The reader is referenced to cites spread across 
the chapter for further study. 

Throughout the rest of the book we will see that their influence in Matrix Graph 
Grammars varies considerably depending on the topic. For example, our basic diagram 
for graph rewriting is similar to that of SPO 9 but the way to deal with restrictions on 
rules (application conditions) is much more "logical" , so to speak. 

9 Chapter 5 defines what a derivation is in Matrix Graph Grammars. Two different but equiva- 
lent definitions of derivations are provided, one using a pushout construction plus an operator 
defined on productions and another with no need of categorical constructions. 
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We are now in the position to introduce the basics of our proposal for graph grammars. 
This will be carried out in the next chapter, Chap. 4, with the pecularity that (to some 
extent) there is no need for a match of the rule's left hand side, i.e. we have productions 
and not direct derivations. 

Matching will be studied in detail in Chap. 5, so all concepts and theorems of Chap. 
4 for productions can be extended to direct derivations and derivations. 



4 
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In this chapter, ideas outlined in Chap. 1 will be soundly based, assuming a background 
knowledge on the material of Sees. 2.1, 2.3 and 2.6. No matching to any host graph is 
assumed, although identification of elements (in essence, nodes) of the same type will be 
specified through completion. 

Analysis techniques developed in this chapter include coherence of sequences, min- 
imal and negative initial digraphs and compatibility of productions inside a sequence. 
These concepts will be used to tackle applicability (problem 1), sequential independence 
(problem 3) and reachability (problem 4). 

In Sec. 4.1 the dynamic nature of a single grammar rule is developed together with 
some basic facts. The operation of completion is studied in Sec. 4.2, which basically 
permits algebraic operations to be performed as one would like. Section 4.3 deals with 
sequences, i.e. ordered sets of grammar rules applied one after the other. 1 To this end we 
will introduce the concept of coherence. Due to its importance, sequences will be studied 
in deep detail in Chap. 6. The problem of finding those elements that must be present 
(minimal initial digraph) or must not appear (negative initial digraph) is addressed in 
Sec. 4.4. At times it is of interest to build a rule that performs the same actions than 
a given coherent sequence but is applied in a single step, i.e. no intermediate states 
are generated. This is composition, as normally defined in mathematics. As they are 
related, the definition of compatibility for a sequence of productions is also defined and 

1 At times we will use the term concatenation as a synonym. A derivation is a concatenation 
of direct derivations, and not just of productions. 
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characterized in Sec. 4.5. Finally, as in every chapter, there is a section with a summary 
and some conclusions. 

4.1 Characterization and Basic Concepts 

A production (also known as grammar rule) is defined as an application which transforms 
a simple digraph into another simple digraph, p : L — > R. We can describe a production 
p with two matrices (those with an E superindex) and two vectors (those with an N 
superindex), p = (L E , R E ,L N , R N ), where the components are respectively the left hand 
side edges matrix (L E ) and nodes vector (L N ), and the right hand side edges matrix 
(R E ) and nodes vector (R N ) ■ 

L E and R E are the adjacency matrices and L N and R N are the nodes vector as 
studied in Sec. 2.3. A formal definition is given for further reference: 

Definition 4.1.1 (Production - Static Formulation) A grammar rule or produc- 
tion p is a partial morphism 2 between two simple digraphs L and R, and can be specified 
by the tuple 



where E stands for edge and N for node. L is the left hand side and R is the right hand 
side. 

It might seem redundant to specify nodes as they are already in the adjacency matrix. 
The reason is that they can be added or deleted during rewriting. Nodes and edges 
are considered separetely, although it could be possible to synthesize them in a single 
structure using tensor algebra. See the construction of the incidence tensor - Def. 1 - in 



It is more interesting to characterize the dynamic behaviour of rules for which matrices 
will be used, describing the basic actions that can be performed by a production: Deletion 
and addition of nodes and edges. Our immediate target is to get a dynamic formulation. 

In this book p will be injective unless otherwise stated. A production models deletion 
and addition actions on both edges and nodes, carried out in the order just mentioned, 
2 Partial morphisms since some elements in L may not have an image in R. 




(4.1) 



Sec. 8.3. 
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i.e. first deletion and then addition. Appropriate matrices are introduced to represent 
them. 

Definition 4.1.2 (Deletion and Addition of Edges) Matrices for deletion and ad- 
dition of edges are defined elementwise by the formulas 

J? ( 1 if edqe (i, j) is to be erased 

I 0 otherwise 

( 1 if edqe (i, j) is to be added 
r E = (r).. = I (4.3) 
10 otherwise 

For a given production p as above, both matrices can be calculated through identities: 

e E = L E a (L E a R E ) = L E a (I? v W^j = L E a W (4.4) 
r E =R E a (L E a R E ) = R E a (W v Z^) = R E a (4.5) 

where L E a R e are the elements that are preserved by the rule application (similar to 
the K component in DPO rules, see Sec. 3.1). Thus, using previous construction, the 
following two conditions hold and will be frequently used: Edges can be added if they 
do not currently exist and may be deleted only if they are present in the left hand side 
(LHS) of the production. 

r E a = R E a a LZ = r E (4.6) 
e E a L E = L E a W a L E = e E . (4.7) 

In a similar way, vectors for the deletion and addition of nodes can be defined: 

Definition 4.1.3 (Deletion and Addition of Nodes) 

M , s ( 1 if node i is to be erased 
e = (e)j = I (4.8) 
I 0 otherwise 

fj , . ( 1 if node i is to be added 
r N = {r) l = \ 3 (4.9) 

0 otherwise 
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Fig. 4.1. Example of Production 



Example. □ An example of production is graphically depicted in Fig. 4.1. Its associated 
matrices are: 



ef 



0 1 1 


2 


Lf = 


1 


2 


0 0 0 


4 


1 


4 


1 0 1 


5 




1 


5 



Rf 



0 1 0 


2 




0 


2 


0 0 0 


4 


1 


4 


1 0 0 


5 




0 


5 



0 1 1 


2 


0 10 


3 


0 1 1 


5 


0 1 0 


2 


0 1 0 


3 


0 1 0 


5 



N 



1 


2 


1 


3 


1 


5 


0 


2 


1 


3 


0 


5 



The last colummn of the matrices specify node ordering, which is assumed to be 
equal by rows and by columns. In Sec. 4.4 - Subsec. 4.4.1 - the characterization of 
productions through matrices will be completed by introducing the nihilation matrix 
and the negative initial digraph. They keep track of all elements that can not be present 
in the graph (dangling edges and those to be added by the production). For an example 
of production with all its matrices, please see the one on page 77. 1 

Now we state some basic properties that relate the adjacency matrices and e and r. 



Proposition 4.1.4 (Rewriting Identities) Let p : L 

lowing identities are fulfilled: 



R be a production, the fol- 



r E A e E = r E 



e E Ar E 



R E a e E = R 1 



L E a r E 



r N a e N = r N 



e N a r N = e N 



R N Ae N =R 



N 



L N A r N 



- N 



(4.10) 

(4.11) 
(4.12) 
(4.13) 
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Proof 

□It is straightforward to prove these results using basic boolean identities. Only the first 
one is included: 

r E A ~^ = (In a i?) a (l e a W^j = 

= (In a R a Z^) v (Zz a R e a R E ^j = 

= (lE a v (i^ a = r E v r E = r E . (4.14) 

The rest of the identities follow easily by direct substitution of definitions. ■ 

First two equations say that edges or nodes cannot be rewritten - erased and created 
or viceversa - by a rule application (a consequence of the way in which matrices e and 
r are calculated). This is because, as we will see in formulas (4.16) and (4.17), elements 
to be deleted are those specified by e and those to be added are those in r, so common 
elements are: 

e a r = e aT a r a e = 0. (4-15) 

This contrasts with the DPO approach, in which edges and nodes can be rewritten in 
a single rule. 3 The remaining two conditions state that if a node or edge is in the right 
hand side (RHS), then it can not be deleted, and that if a node or edge is in the LHS, 
then it can not be created. 

Finally we are ready to characterize a production p : L — » R using deletion and 
addition matrices, starting from its LHS: 

R" = r N v (eF a L N ^j (4.16) 

R £ =r £ vp A L £ ). (4.17) 

The resulting graph R is calculated by first deleting the elements in the initial graph 
- e a L - and then adding the new elements - r v (e a L) -. It can be demonstrated using 
proposition 4.1.4 that, in fact, it doesn't matter whether deletion is carried out first and 
addition afterwards or viceversa. 4 

3 It might be useful for example to forbid a rule application if the dangling condition is violated. 
This is addressed in Matrix Graph Grammars through e-productions, see Chap. 5. 

4 The order in which actions are performed does matter if instead of a single production we 
consider a sequence. See comments after the proof of corollary 4.4.3. 
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Remark, nln the rest of the book we will omit a if possible, and avoid unnecessary 
parenthesis bearing in mind that a has precedence over v. So, e.g. formula (4.17) will 
be written 

R E =r E V ^E L E (418) 

Besides, if there is no possible confusion due to context or a formula applies to both 
edges and nodes, superscripts can be omitted. For example, the same formula would read 
R = r v eLM 

There are two ways to characterize a production so far, cither using its initial and 
final states (see definition 4.1.1) or the operations it specifies: 

p= (e E ,r E ,e N ,r N ). (4.19) 

As a matter of fact, they are not completely equivalent. Using L and R gives more 
information because those elements which are present in both of them are mandatory 
if the production is to be applied to a host graph, but they do not appear in the e-r 
characterization. 5 An alternate and complete definition to (4.1) is 

p= (L E ,e E ,r E ,L N ,e N ,r N ). (4.20) 

A dynamic definition of grammar rule is postponed until Sec. 4.4.2, definition 4.4.4 
because there is a useful matrix (nihilation matrix) that have not been introduced yet. 

Some conditions have to be imposed on matrices and vectors of nodes and edges 
in order to keep compatibility when a rule is applied, that is, to avoid dangling edges 
once the rule is applied. It is not difficult to extend the definition of compatibility from 
adjacency matrices (see definition 2.3.2) to productions: 

Definition 4.1.5 (Compatibility) A production p : L — > R is compatible if R = p(L) 
is a simple digraph. 

From a conceptual point of view the idea is the same as that of the dangling condition 
in DPO. Also, what is demanded here is completeness of the underlying space Graph p 
with respect to the operations defined. 

5 This usage of elements whose presence is demanded but are not used is a sort of positive 
application condition. See Chap. 7. 
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Next we enumerate the implications for Matrix Graph Grammars of compatibility. 
Recall that * denotes transposition: 

1. An incoming edge cannot be added (r E ) to a node that is going to be deleted (e N ): 

\\r E Qe N \\ 1 =0. (4.21) 

Similarly, for outgoing edges (j" 6 )*, the condition is: 



(r E ) 0. 



0. 



(4.22) 



2. Another forbidden situation is deleting a node with some incoming edge, if that edge 
is not deleted as well: 



e E L E Qe N 



0. 



Similarly for outgoing edges: 



(e E L E ) l O< 



N 



0. 



(4.23) 



(4.24) 



Note that e E L E are elements preserved (used but not deleted) by production p. 
3. It is not possible to add an incoming edge (r B ) to a node which is neither present in 
the LHS (l N ^J nor added (r N ) by the production: 



5 0 (r N L N ^j 
Similarly, for edges starting in a given node: 

(r £ ) 4 0pF) 



0. 



0. 



(4.25) 



(4.26) 



4. Finally, our last conditions state that it is not possible that an edge reaches a node 
which does not belong to the LHS and which is not going to be added: 



(e E L E ^j 0 (r N L N ^j 
And again, for outgoing edges: 



0. 



0. 



(4.27) 



(4.28) 
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Thus wc arrive naturally at the next proposition: 



Proposition 4.1.6 Let p : L — » R be a production, if conditions (4-21) - (4-28) are 
fulfilled then R = p(L) is compatible. 6 

Proof 

□ We have to check \\(M E v M|) ©Mjv^ = 0, with M E = r E v e E L E and ~M N = 
Applying (4.11) in the second equality we have 



(M E v M%) QM N 



(r E v e E L E ) v (r B v e^)'] 0 [r N (e N v L^)] 



r E v e E L E v (r £ )' v (e E £ B )1 0 (e w v r* L») . 



(4.29) 



Synthesizing conditions (4.21) - (4.28) or expanding (4.29) the proof is completed. ■ 

A full example is worked out in the next section, together with further explanations 
on node identification across productions and types. 



4.2 Completion 

Besides characterization, it is necessary to represent simple digraphs in some way. Note 
that a graph is just a collection of nodes and edges and our aim is to associate some 
algebraic structure to it. 

Grammars in essence rely on the possibility to apply several morphisms (productions) 
in sequence, generating languages. At grammar design time we do not know in general 
which actual initial state is to be studied but probably we do know which elements make 
up the system under consideration. 

For example, in a local area network we know that there are messages, clients, servers, 
routers, hubs, switches and cables. When a grammar is designed to study failure depen- 
dencies in the net, it can be the case that we do not know the actual net to be considered. 
Even more, our aim could be to design a grammar/language to study any possible local 
area net dependency, deadlock and failure recovery. 7 

6 p(L) is given by (4.16) and (4.17). 

7 An ambitious project, though. 
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It seems natural to introduce types, which are simply a level of abstraction in the set 
of elements under consideration. For example, in previous paragraph, messages, clients, 
servers, etcetera would be types. So there is a ground level in which real things are (one 
actual hub) and another a little bit more abstract level in which families of elements live. 



Example. □ Along this book we will use two ways of typing productions. The first 
manner will be to use natural numbers N > 0 and primes to distinguish between elements. 
On the left side of Fig. 4.2 there is a typical simple digraph with three nodes 1 (they are of 
type 1). This is correct as long as we do not need to operate with them. During "runtime" , 
i.e. if some algebraic operation is to be carried out, it is mandatory to distinguish between 
different elements, so primes are appended as depicted on the center of the same figure. 

For the second way of typing productions, check out a small network in Fig. 4.2 where 
there are two clients - (1:C) and (2:C) - one switch - (1:SW) - one router - (1:R) - and 
one server - (1:S) -. Types are C, SW, R and S and instead of primes we use natural 
numbers to distinguish among elements of the same type. 

Their adjacency matrices are: 




Fig. 4.2. Examples of Types 



10 0 0 
0 0 0 0 
0 0 11 
1110 



V 

1" 
1"' 

2 



0 0 10 1 

0 0 0 0 1 

0 0 110 

0 0 0 0 0 

10 0 11 



1 : C 

2 : C 
1 : R 
1 : S 

1 : SW 



Nodes of the same type can be identified across productions or when performing any 
kind of operation, while nodes of different types must remain unrelated. A production 
can not change the type of any node. In some sense, nodes in the left hand and right 
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hand sides of productions specify their types. Matching (refer to Chap. 5) transforms 
them in "actual" elements. 

Types of edges are given by the type of its initial and terminal nodes. In the example 
of Fig. 4.2, the type of edge e is (1, 2) and the type of edge e' is (2, 1). For edges, types 
(1,2) and (2,1) are different. See [10]. 

A type is just an element of a predefined set T and the assignment of types to 
nodes of a given graph G is just a (possibly non-injective) total function from the graph 
under consideration to the set of types, to ■ G — > T , such that it defines an equivalence 
relation ~ in G. 8 It is important to have disjoint types (something for granted if the 
relation is an equivalence relation) so one element does not have two types. In previous 
example, the first way of typing nodes would be T\ = N > 0 and the second T 2 = 
{(a : 0)\a e N > 0, (3 e {C, S, R, SW}}. 

The notion of type is associated to the underlying algebraic structure and normally 
will be specified using an extra column on matrices and vectors. Conditions and restric- 
tions on types and the way they relate to each other must be specified using restrictions 
(see Chap. 7). 

Next wc introduce the concept of completion. In previous sections we have assumed 
that when operating with matrices and vectors these had the same size, but in general 
matrices and vectors represent graphs with different sets of nodes or edges, although 
probably there will be common subsets. 

Completion modifies matrices (and vectors) to allow some specified operation. Two 
problems may occur: 

1. Matrices may not fully coincide with respect to the nodes under consideration. 

2. Even if they are the same, they may well not be ordered as needed. 

To address the first problem matrices and vectors are enlarged, adding the missing 
vertices to the edge matrix and settting their values to zero. To declare that these elements 
do not belong to the graph under consideration, the corresponding node vector is also 
enlarged setting to zero the newly added vertices. 

8 A reflexive (V3 e G,g ~ g), symmetric (V<?i,(?2 e G, [gi ~ g2 4» 32 ~ gi]) and transitive 
(Vffi, 32, 33 e G, [gi - g 2 , 32 ~ 53 gi ~ 33]) relation. 
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If for example an and is specified between two matrices, say A a B, the first thing 
to do is to reorder elements so it makes sense to and element by element, i.e. elements 
representing the same node are operated. If we are defining a grammar on a computer, 
the tool or environment will automatically do it but some procedure has to be followed. 
For the sake of an example, the following is proposed: 

1. Find the set C of common elements. 

2. Move elements of C upwards by rows in A and B, maintaining the order. A similar 
operation must be done moving corresponding elements to the left by columns. 

3. Sort common elements in B to obtain the same ordering as in A. 

4. Add remaining elements in A to B sorted as in A, immediately after the elements 
accessed in previous step. 

5. Add remaining elements in B to A sorted as in B. 

Addition of elements and reordering (the operations needed for completion) extend 
and modify productions syntactically but not from a semantical point of view. 



Example. nConsider the production depicted in Fig. 4.3. Its associated matrices are 
represented below. As already commented above, the notation for matrices will be ex- 
tended a little bit in order to specify node and edges types. It is assumed for the adjacency 
matrix that it is equally ordered by rows so we do not add any row. If it is clear from 
context or there is a problem with space, this labeling column will not appear, making 
it explicit in words if needed. 



L 



R 




Fig. 4.3. Example of Production (Rep.) 
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Lf 



e l 



0 1 1 


2 


Lf = 


1 


2 


0 0 0 


4 


1 


4 


1 0 1 


5 




1 


5 



Rf 



0 1 0 


2 


e^ - 
e i — 


0 


2 


0 0 0 


4 


1 


4 


1 0 0 


5 




0 


5 



0 1 1 


2 


0 1 0 


3 


0 1 1 


5 


0 1 0 


2 


0 1 0 


3 


0 1 0 


5 



R 



N 



1 


2 


1 


3 


1 


5 


0 


2 


1 


3 


0 


5 



For example, if the operation ef rf was to be performed, then both matrices must 
be completed. Following the steps described above we obtain: 





0 10 0 


2 




0 0 0 1 


2 




1 


2 




1 


2 


P E - 


0 0 0 0 
10 0 0 


4 
5 


rf = 


0 0 0 0 
0 0 0 1 


4 
5 


Lf = 


1 
1 


4 
5 


< = 


0 
1 


4 
5 




0 0 0 0 


3 




0 0 0 1 


3 




0 


3 




1 


3 



where, besides the erasing and addition matrices, the completion of the nodes vectors for 
both left and right hand sides are displayed. 

Now we check whether rf v ef Lf and rf v ef Lf are compatible, i.e. Rf and Rf 
define a simple digraph. Proposition 2.3.4 and equation (2.4) are used, so we need to 
compute (4.29) and, as 



r a \i p E T E 

r 1 v e 1 l, 1 



substituting we finally arrive at 



0 0 11 


2 


0 0 0 0 


4 


0 0 11 


5 


0 0 0 1 


3 



(4.29) 



0 0 11 
0 0 0 0 
0 0 11 
0 0 0 1 



rf(efvLf) 



0 0 0 0 
0 0 0 0 
10 10 
10 11 



o 



0 


2 


1 


4 


0 


5 


0 


3 




"o 




0 




0 




0 



as desired. ■ 

It is not possible, once the process of completion has finished, to have two nodes 
with the same number inside the same production 9 because from an operational point of 
view it is mandatory to know all relations between nodes. If completion is applied to a 
sequence then we will speak of a completed sequence. 



For example, if there are two nodes of type 8, after completion there should be one with a 8 
and the other with an 8'. 
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Note that up to this point only the production itself has been taken into account. 
Although this is half truth - as you will promptly see - we may say that we are starting 
the analysis of grammar rules without the need of any matching, i.e. we will analyze 
productions and not necessarily direct derivations, with the advantage of gathering in- 
formation at a grammar definition stage. Of course this is a desiderable property as long 
as results of this analysis can be used for derivations (during runtime) . 

In some sense completion and matching are complementary operations: Inside a se- 
quence of productions, matchings - as side effect - differentiate or relate nodes (and 
hence, edges) of productions. Completion imposes some restrictions to possible match- 
ings. If we have the image of the evolution of a system by the application of a derivation 
as depicted in Fig. 4.9 on p. 89, then matchings can be viewed as vertical identifications, 
while completions can be seen as horizontal identifications. 

The way completion has been introduced, there is a deterministic part limited to 
adding dummy elements and a non-deterministic one deciding on identifications. It should 
be possible to define it as an operator whose output would be all possible relations among 
elements (of the same type), i.e. completion of two matrices would not be two matrices 
anymore, but the set of matrices in which all possible combinations would be considered 
(or a subset if some of them can be discarded). This is related to the definition of initial 
digraph set in Sec. 5.3 and the structure therein studied. 

4.3 Sequences and Coherence 

Once we are able to characterize a single production, we can proceed with the study of fi- 
nite collections of them. 10 Two main operations, composition and concatenation, 11 which 
are in fact closely related, are introduced in this and next sections, along with notions 
that make it possible to speak of "potential definability" : Coherence and compatibility. 

Definition 4.3.1 (Concatenation) Let 0 be a grammar. Given a collection of produc- 
tions {pi, . . . ,p n } c 25, the notation s n = p n ',Pn-i', ■ ■ ■ ',Pi defines a sequence (concate- 
nation) of productions establishing an order in their application, starting with p\ and 
ending with p n . 

10 The term set instead of collection is avoided because repetition of productions is permitted. 

11 Also known as sequentialization. 
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In the literature of graph transformation, the concatenation operator is defined back 
to front, this is, in the sequence P2',Pi, production p 2 would be applied first and pi 
right afterwards [11]. The ordering already introduced is preferred because it follows the 
mathematical way in which composition is defined and represented. 12 

It is worth stressing that there exists a total order in a sequence, one production being 
applied after the previous has finished, and thus intermediate states are generated. These 
intermediate states are indeed the difference between concatenation and composition 
of productions (see Sec. 4.5). The study of concatenation is related to the interleaving 
approach to concurrency, while composition is related to the explicit parallelism approach 
(see Sec. 3.1). 

A production is moved forward, moved to the front or advanced if it is shifted one 
or more positions to the right inside a sequence of productions, either in a composition 
or a concatenation (it is to be applied earlier), e.g. P4;P3',P2',Pi >-» Ps\P2',Pi',P4- On 
the contrary, move backwards or delay means shifting the production to the left, which 
implies delaying its application, e.g. P4; p 3 ; p 2 ; Pi >-> Pi;P4',P3',P2- 

Definition 4.3.2 (Coherence) Given the set of productions {pi, . . . ,p n }, the completed 
sequence s n = p n ;p n -\; . . . ;pi is called coherent if actions of any production do not pre- 
vent actions of the productions that follow it, taking into account the effects of interme- 
diate productions. 

Coherence is a concept that deals with potential applicability to a host graph of 
a sequence s„ of productions. It docs not guarantee that the application of s n and a 
coherent reordering of s n , a (s n ), lead to the same result. This latter case is a sort of 
generalization 13 of sequential independence applied to sequences, which will be studied 
in Chap. 6. 

Example. □ We extend previous example (see Fig. 4.3 on p. 77) with two more pro- 
ductions. Recall that our first production q\ deletes edge (5, 2), which starts in vertex 5 
and ends in vertex 2. As depicted in Fig. 4.4, production q 2 adds this edge and q 3 uses 
it. Sequence S3 = qz\q2] qi would be coherent if only this vertex was considered. ■ 

12 This issue will be raised again in Sec. 8.1. 

13 Generalization in the sense that, a priori, we are considering any kind of permutation. 
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Now we study the conditions that have to be satisfied by the matrices associated with 
a coherent sequence of productions. Instead of stating a result concerning conditions on 
coherence and proving it immediately afterwards, we begin by discussing the case of two 
productions in full detail, we continue with three and we finally set a theorem - theorem 
4.3.5 - for a finite number of them. 

Let us consider the concatenation S2 = P2 ; Pi ■ In order to decide whether the appli- 
cation of pi does not exclude p2, we impose three conditions on edges: 14 

1. The first production - p\ - does not delete any edge (ef ) used by the second pro- 
duction (Lf)' 

ef if = 0. (4.30) 

2. p 2 does not add (rf) any edge preserved (used but not deleted, efif ) by p\\ 

rf L f if = 0. (4.31) 

3. No common edges are added by both productions: 

rf rf = 0. (4.32) 

The first condition is needed because if p\ deletes an edge used by P2, then p2 would 
not be applicable. The last two conditions are mandatory in order to obtain a simple 
digraph (with at most one edge in each direction between two nodes). 
14 Note the similarities with weak sequential independence. See Sec. 3.2. 
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Conditions (4.31) and (4.32) are equivalent to rfiif = 0 because, as both are equal 
to zero, we can do 

0 = rf Lf if v rfrf = tf (rf v if Lf ) = rf i?f 

which may be read "p2 does not add any edge that comes out from p\ 's application" . All 
conditions can be synthesized in the following identity: 

r$R? v ef Lf = 0. (4.33) 

Our immediate target is to obtain a closed formula to represent these conditions 
for the case of an arbitrary finite number of productions. Applying (4.10) and (4.11), 
equation (4.33) can be transformed to get: 

flfifrf vLfef^f =0. (4.34) 
A similar reasoning gives the corresponding formula for nodes: 

R?7»r?vL?e»W = 0- (4.35) 
Remark. oNote that conditions (4.31) and (4.32) do not really apply to nodes as 
apply to edges. For example, if a node of type 1 is to be added and nodes 1 and 1' have 
already been appended, then by completion node 1" would be added. It is not possible 
to add a node that already exists. 

However, coherence looks for conditions that guarantee that the operations specified 
by the productions of a sequence do not interfere one with each other. Suppose the same 
example but this time, for some unknown reason, the node to be added is completed as 
V - this one has just been added -. If conditions of the kind of (4.31) and (4.32) are 
removed, then we would not detect that there is a potential problem if this sequence is 
applied. ■ 

Next we introduce a graphical notation for Boolean equations: A vertical arrow means 
and while a fork stands for or. We use these diagrams because formulas grow very fast 
with the number of nodes. As an example, the representation of equations (4.34) and 
(4.35) is shown in Fig. 4.5. 

Lemma 4.3.3 Let s 2 = P2',Pi be a sequence of productions. If equations (4-34) and 
(4-35) hold, then s 2 is coherent. 
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Ec F • Ec N 



Rf4 L?* R?4 L»\ 



E 



E 



i it Vl :t 



Fig. 4.5. Coherence for Two Productions 



Proof 

□Only edges are considered because a symmetrical reasoning sets the result for nodes. 
Call D the action of deleting an edge, A its addition and P its preservation, i.e. the 
edge appears in both LHS and RHS. Table 4.1 comprises all nine possibilities for two 
productions. 



D 2 ;D 1 


(4.30) 


D 2 ;Pi 


V 


D 2 -A x 


V 


P 2 ;D 1 


(4.30) 


P 2 ;Pi 


V 


P 2 ;Ai 


V 


A 2 -D x 


V 


A 2 ;Pi 


(4.31) 


A 2 ; Ai 


(4.32) 



Table 4.1. Possible Actions for Two Productions 



A tick means that the action is allowed, while a number refers to the condition that 
prohibits the action. For example, P2; D\ means that first production p\ deletes the edge 
and second P2 preserves it. If the table is looked up we find that this is forbidden by 
equation (4.30). ■ 

Now we proceed with three productions. We must check that P2 does not disturb 
P3 and that p\ does not prevent the application of p2- Notice that both of them are 
covered in our previous explanation (in the two productions case), and thus we just need 
to ensure that p\ does not exclude ps, taking into account that p2 is applied in between. 

1. pi does not delete (ef ) any edge used (if) by p% and not added (rf) by P2- 

efLff = 0. (4.36) 
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2. Production p^ does not add - rf - any edge stemming from p\ - Rf - and not 
deleted by P2 - ef -: 

rf i?f 71 = 0. (4.37) 

Again, the last condition is needed in order to obtain a simple digraph. Performing 
similar manipulations to those carried out for S2 we get the full condition for S3, given 
by the equation: 

if ef v Li (ef rf v ei) v Rf (if rf v rf ) v i?f rf = 0. (4.38) 
Proceeding as before, identity (4.38) is completed: 

Lfefrf vLfrf (efrf ve 2 £ ) v 

v flfif (rf v if rf ) v lif if rf = 0. (4.39) 
Its representation is shown in Fig. 4.6 for both nodes and edges. 




Fig. 4.6. Coherence Conditions for Three Productions 



Lemma 4.3.3 can be extended slightly to include three productions in an obvious way, 
but we will not discuss this further because the generalization to cover n productions is 
theorem 4.3.5. 

Example. □Recall productions qi, qi and q% introduced in Figs. 4.3 and 4.4 (on pp. 
77 and 81, respectively). Sequences (73; (72; Qi and qi; q^; 92 are coherent, while q^; q\\ qi is 
not. The latter is due to the fact that edge (5, 5) is deleted (D) by 92, used (U) by q\ and 
added (A) by 53, being two pairs of forbidden actions. For the former sequences we have 
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to check all actions performed on all edges and nodes by the productions in the order 
specified by the concatenation, verifying that they do not exclude each other. ■ 

Definition 4.3.4 Let F(x, y) and G(x, y) be two boolean functions dependant on para- 
meters x,y 6 I in some index set I. Operators delta A and nabla V are defined through 
the equations: 

A t t l(F(x,y))= V (/\(F(x,y))\ (4.40) 

y = t 0 \x=y / 

V $l(G( X ,y))= V (f\(G(x,y))\. (4.41) 

y=t 0 \x = t 0 J 

These operators will be useful for dealing with the general case of n productions. A 
simple interpretation for both operators will be given at the end of the section. 
Example. nLet F(x,y) = G(x,y) = r x e y , then we have: 

r x ey=\/ I f \ (r x e y ) = r 3 e 3 v r 3 r 2 e 2 v r 3 ^2^iei = e 3 v r 3 e 2 v r 3 r 2 ei. 

y=l \x=y J 
5 /x=y \ 

V3^ e sf=V ( A ^ Txe v) = F 3e 3 v r 3 r 4 e 4 v r 3 r 4 r 5 e 5 = e 3 v r 3 e 4 v r 3 r 4 e 5 . 

y=3 \a;=3 / 

Expressions have been simplified applying proposition 4.1.4.B 

Now we are ready to characterize coherent sequences of arbitrary finite length. 

Theorem 4.3.5 The concatenation s n = p n ;p n -i', ■ ■ ■ ',P2',Pi is coherent if for edges and 
nodes we have: 

n 

V { R f Vr + i (^r^) v Lf Ai" 1 (e y s ^)) = 0 (4.42) 

Z=l 

n 

V K Vr +1 (e»r?) v Lf A'" 1 = 0. (4.43) 

i—l 

Proof 

□Induction on the number of productions (see cases s 2 and s 3 studied above). I 

Figure 4.7 includes the graph representation of the formulas for coherence for s 4 = 

P4-,P3;P2;pi and s 5 = p 5 ;p 4 ;p3;P2;pi- 
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Fig. 4.7. Coherence. Four and Five Productions 



Example. oWe are going to verify that si = (71593; 92 is coherent (only for edges), 
where qi are the productions introduced in previous examples. Productions are drawn 
again in Fig. 4.8 for reader convenience. We start expanding formula (4.42) for n = 3: 

\J(R? V? + i ff<) vif Ar 1 (ef ^)) = flf ff rf vifif rf) v 

i=l 

v i?f ef rf v Lf ef v Lf ff^ef v ef ) = 

= R? (rf v if rf ) v i?f rf v Lf ef v Lf (ef rf v ef ) . 

which should be zero. 

Note that this equation applies to concatenation s = g 3 ; g 2 ; <Zi and thus we have to 
map (1, 2, 3) >— > (2, 3, 1) to obtain 

R? (rf v ifrf ) v i?f rf v Lf ef v Lf (ef ^f v ef ) = 0. (4.44) 
(*) (**) (***) 

Before checking whether these expressions are zero or not, we have to complete the 
involved matrices. All calculations have been divided into three steps and, as they are 
operated with or, the result will not be null if one fails to be zero. 

Only the second term (**) is expanded, with ordering of nodes not specified for a 
matter of space. Nodes are sorted [2 3 5 1 4] both by columns and by rows, meaning for 
example that element (3,4) is an edge startig in node 5 and ending in node 1. 
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Fig. 4.8. Productions qi, q-z and qz (Rep.) 
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0 10 0 0 




0 0 0 1 0 




0 10 0 0 


0 0 0 0 0 




0 10 0 0 




1 0 0 0 0 




0 0 0 0 0 


10 10 0 




0 10 0 0 
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1 0 0 0 0 




0 0 10 0 


0 0 10 0 




0 0 0 0 0 




0 0 0 1 0 




0 0 0 0 0 


0 0 0 0 0 




0 0 0 0 0 




0 0 0 0 0 




0 0 0 0 0 



= 0, 



so the sequence is coherent 15 where, as usual, a matrix filled up with zeros is represented 
by 0. 

Now consider sequence S3 = qi\ q%; q\ where qi and qs have been swapped with respect 
to S3. The condition for its coherence is: 

Rf (rf v if tf ) v Rgtf v Lief v L? (ef 7f v ef ) = 0. (4.45) 
(*) '•**•' (***) 

If we focus just on the first term (*) in equation (4.45) 
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0 110 0 






0 0 10 0 
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1 0 0 0 0 




0 0 0 0 0 






0 0 10 0 




1110 1 




0 0 0 0 0 




0 0 0 0 0 
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0 0 0 0 0 




11111 




0 0 0 0 0 


/ 



we obtain a matrix filled up with zeros except in position (3,3) which corresponds to an 
edge that starts and ends in node 5. Ordering of nodes has been omitted again due to 
lack of space, but it is the same as above: [2 3 5 1 4]. 



It is also necessary to check that (*) = (•••)= 0. 
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We do not only realize that the sequence is not coherent, but in addition information 
on which node or edge may present problems when applied to an actual host graph is 
provided. ■ 

Note that a sequence not being coherent does not necessarily mean that the grammar 
is not well defined, but that we have to be especially careful when applying it to a host 
graph because it is mandatory for the match to identify all problematic parts in different 
places. 

This information could be used when actually finding the match; a possible strategy, 
if parallel matching for different productions is required, is to start with those elements 
which may present a problem. 16 

This section ends providing a simple interpretation of V and A, which in essence 
are a generalization of the structure of a sequence of productions. A sequence P2',Pi is 
a complex operation: To some potential digraph, one should start by deleting elements 
specified by ei, then add elements in ri, afterwards delete elements in e 2 and finally add 
elements in r 2 . Generalization means that this same structure can be applied but not 
limited to matrices e and r, i.e. there is an alternate sequence of "delete" and "add" 
operations with general expressions rather than just matrices e and r. For example, 
V? (e^R x v L y v r y ). 

Operators V and A represent ascending and descending sequences. For example, 
V \e x Ty 

= PiPi{r$) and A\e x r y = p 3 p2(n). In some detail: 

Vf e x r y = e\r\ v ei e 2 r 2 v e\ e 2 e 3 r 3 = 

= ri v eir 2 v ei e 2 r 3 = n v ei (r 2 v e 2 r 3 ) = pi (p 2 (r 3 )) . 

We will make good use of this interpretation in Chap. 5 to establish the equivalence 
between coherence plus compatibility of a derivation and finding its minimal and negative 
initial digraphs in the host graph and its negation, respectively. 

4.4 Minimal and Negative Initial Digraphs 

Matches find the left hand side of the production in the host graph (see Chap. 5) and, as 
side effect, relate and unrelate elements among productions. Recall from previous section 
16 The same remark applies to G- congruence, to be studied in Sec. 6.1. 
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that we may think of matching as a vertical identification of nodes - and hence edges - 
relating as a side effect elements, so to speak, horizontally (see Fig. 4.9). For example, if 
L\ and Li have each one a node of type 3 and m\ : L\ — > Go and mi : Li — » G\ match 
this node in the same place of Go and G\ (suppose it is not deleted by p\) then this node 
is horizontally related. In Subsec. 4.4.1 we will study in detail this sort of relations. 

Compatibility is determined by the result of applying a production to an initial graph 
and checking nodes and edges of the result. If we try to define compatibility for a concate- 
nation or its composition, we have to decide which is the initial graph, but as mentioned 
above we prefer not to begin our analysis of matches yet. These are our main motivations 
to introduce minimal and negative initial digraphs. 



Li — Ri L 2 — R2 L 3 — R 3 




Fig. 4.9. Example of Sequence 



Example. ^Consider productions u and v defined in Fig. 4.10. It is easy to see that v; u 
is coherent but not compatible. It seems a bit more difficult to define their composition 
v o u, as if they were applied to the same nodes, a dangling edge would be obtained. 
Although coherence itself does not guarantee applicability of a sequence, we will see that 
compatibility is sufficient (generalized to consider concatenations, not only graphs or 
single productions as in definitions 2.3.2 and 4.1.5).B 



• - ► #>2 

2 4 



• 2 

4 



R. 



Fig. 4.10. Non-Compatible Productions 
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Defining compatibility for a sequence of productions is not straightforward, nor its 
composition starting with a coherent concatenation, and it will be necessary to bring in 
the concept of minimal initial digraph. 

The example shows a problem that led us to consider not only productions, but also 
the context in which they are to be applied. In fact, the minimal context in which they 
can be applied. This is something we try to avoid as, for the moment, we want to study 
productions alone without host graphs and matches. 

Two possibilities are found in the literature (for the categorical approach) in order 
to define a match, depending whether DPO or SPO is followed (see Sees. 3.1 and 3.2 
or [23]). In the latter, deletion prevails so in the present example production v would 
delete edge (4,2). Our approximation to the match of a production is slightly different, 
considering it as an operator that acts on a space whose elements are productions (see 
Chap. 5). 17 

This situation might be overcome if we are able to define a minimal and unique 18 
"host graph" with enough elements to permit all operations of a given concatenation or 
composition of productions. We call it a minimal initial digraph. Note that we were able 
to give a definition of compatibility in 2.3.2 for a single production because it is clear 
(so obvious that we did not mention it) which one is the minimal initial digraph: Its left 
hand side. 

Any production demands elements to exist in the host graph in order to be applied. 
Also, some elements must not be present. We will touch on "forbidden" elements in 
Subsec. 4.4.2. Both are quite useful concepts because on one hand they allow us to 
ignore matching if staying at a grammar definition level is desired (to study its potential 
behaviour or to define concepts independently of the host graph), and on the other the 
applicability problem (see problem 1) can be characterized through them. We will return 
to these concepts once matching is introduced and characterized, in Sec. 5.3 and also in 
Chap. 7 when we define graph constraints and application conditions. 

17 In the SPO approach - see Sec. 3.2 - rewriting has as side effect the deletion of dangling 
edges. One important difference is that in our approach it is defined as an operator that 
enlarges the production or the sequence of productions by adding new ones. 

18 Unique once the concatenation has been completed. Minimal initial digraph makes horizontal 

identification of elements explicit. 
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4.4.1 Minimal Initial Digraph 

One graph is known which fulfills all demands of coherent sequence s n = p n ;...;pi - 
namely L = V™=i ^* ~ m ^ ne sense that it has enough elements to carry out all operations 
specified in the sequence. Graph & is not completed (each Li with respect to the rest). If 
there are coherence issues among all grammar rules, then probably all nodes in all LHS 
of the rules will be unrelated giving rise to the disjoint union of Li. If, on the contrary, 
there are no coherence problems at all, then we can identify across productions as many 
nodes of the same type in Lj as desired. 

Definition 4.4.1 (Minimal Initial Digraph) Let s n = p n ; . . . ; pi be a completed se- 
quence, a minimal initial digraph is a simple digraph which permits all operations of s n 
and does not contain any proper subgraph with the same property. 

This concept will be slightly generalized in Sec. 5.3, definition 5.3.1, in which we 
consider the set of all potential minimal initial digraphs for a given (not-completed) 
sequence and analyze its structure. In fact, £ is not a digraph but this initial digraph 
set. Through completion one actual digraph can be fixed. 

Theorem 4.4.2 Given a completed coherent sequence of productions s n = p n ;...;pi, 
the minimal initial digraph is defined by the equation: 

M n = V? feLy) . (4.46) 

Superscripts are omitted to make formulas easier to read (i.e. they apply to both nodes 
and edges). In Fig. 4.14 on p. 97, formula (4.46) and its negation (4.57) are expanded for 
three productions. 
Proof 

□To properly demonstrate this theorem we have to prove that M n has enough edges 
and nodes to apply all productions in the specified order, that it is minimal and finally 
that it is unique (up to isomorphisms). We will proceed by induction on the number of 
productions. 

By hypothesis we know that the concatenation is coherent and thus the application 
of one production does not exclude the ones coming after it. In order to see that there 
are sufficient nodes and edges, it is enough to check that s n (V™=i ^*) = s ™ {M n ), as the 
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most complete digraph to start with is L = Vi*=i which has enough elements due to 
coherence. 19 

If we had a sequence consisting of only one production si = pi, then it should be 
obvious that the minimal digraph needed to apply the concatenation is L\. 

In the case of a sequence of two productions, say s 2 = P2',Pi, what p\ uses (Li) is 
again needed. All edges that p 2 uses (L 2 ), except those added (ri) by the first production, 
are also mandatory. Note that the elements added (ri) by pi are not considered in the 
minimal initial digraph. If an clement is preserved (used and not erased, e.\ Li) by pi, 
then it should not be taken into account: 

Li v L 2 ri (el-^i) = Li v L 2 t\ (ei v Li) =Ii v £ 2 i?i. (4.47) 

This formula can be paraphrased as "elements used by p\ plus those needed by p 2 's left 
hand side, except the ones resulting frompi's application". It provides enough elements 
to s 2 : 

p 2 ; pi {Li v L 2 Ri) = r 2 v ei (ri v el (L x v L 2 Ri)) = 

= r 2 v ei v riRiL 2 v e!i?iL 2 ) = 

= r 2 v ei v rii 2 v el£ 2 ) = 

= r 2 v ei (n v el (Li v i 2 )) = p 2 ;pi (Li v L 2 ) . 

Let's move one step forward with the sequence of three productions S3 = P3',p 2 ;pi- 
The minimal digraph needs what s 2 needed (Li v L 2 Ri), but even more so. We have 
to add what the third production uses (£3), except what comes out from p\ and is not 
deleted by production p 2 (i?i e 2 ), and finally remove what comes out (i? 2 ) from p 2 : 

M 3 = Liv L 2 7h v L 3 (ei-Ri)i?2 = L\ v L 2 i?T v L 3 i?2" (e 2 v i^) . (4.48) 

Similarly to what has already been done for s 2 , we check that the minimal initial 
digraph has enough elements such that it is possible to apply pi, p 2 and p 3 : 



Recall that £ is not completed so it somehow represents some digraph with enough elements 
to apply s n to. This is not necessarily the maximal initial digraph as introduced in Sec. 5.3. 
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p 3 ;P2;pi (M 3 ) = 

= r 3 v^(r 2 v^(rivei (ii v L2-R1 v L3-R2 (e2 v #1)))) 

-(- - ~ — " 

= ^3 v e 3 r 2 v e 2 eiL 2 v e\e 2 L 3 R 2 v Ri v L 3 e\RiR 2 



V3 v e 3 



e 2 ri v e 2 eiLi ve 2 eii 2 v r 2 v L 3 ei e 2 r 2 L 2 



V 



= 7-3 v ei (r 2 v ei (n v eT (Li v L 2 v £3))) = 
= Vz\V2,V\ (Li v L 2 v £ 3 ). 
The same reasoning applied to the case of four productions derives the equation: 



M 4 = Li v L 2 Ri v L 3 (e^Ri) R 2 v L^e^R^ (e^R 2 ) R 3 - 



(4.49) 



Minimality is inferred by construction, because for each Li all elements added by a 
previous production and not deleted by any production pj, j < i, are removed. If any 
other element is erased from the minimal initial digraph, then some production in s n 
would miss some element. 

Now we want to express previous formulas using operators V and A. The expression 



n 

Lf v\/[if Arfef) 

i=2 



(4.50) 



is close but we would be adding terms that include Rfef, and clearly Rfef # Rf , 
which is what we have in the minimal initial digraph. 20 Thus, considering the fact that 
ab vab = a (see Sec. 2.1) we eliminate them by performing or operations: 



(4.51) 



Thus we have a formula for the minimal initial digraph which is slightly different from 
that in the theorem: 



M„ 



Li v ei Vi 1 (R^L y+1 ) v \f [Li A\ 1 (R^e y )] 



(4.52) 



i=2 



20 Not in formula (4.46) but in expressions derived up to now for minimal initial digraph: 
formulas (4.47) and (4.48). 
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Please refer to Fig. 4.11 where, on the right side, expression (4.52) is represented 
while on the left the same equation, but simplified, is depicted for n = 4. 



Ec 



• 4l 

4 * 3 * • 



Ec 



4 R ' * 4 L2 4 " ] * 

R 



Fig. 4.11. Minimal Initial Digraph (Intermediate Expression). Four Productions 



Our next step is to show that previous identity is equivalent to: 



M„ 



Li v ei Vi 1 (rxLy+i) v \/ [Li A\ 1 (r x e v )] 



(4.53) 



illustrating the way to proceed for n = 3. To this end, equation (4.13) is used as well as 

the fact that a vab = a v b (see Sec. 2.1): 
M 3 = L\ v L 2 R\ v (e2 v R\) = 

= Li v L 2 rT ( e i v ^l) v (£3^262 v L^L^) (e 2 v rTeirlXi) = 
= Li v L 2 r r [L 1 v L 2 ei v L 3 e 2 v L 3 e 2 ei v L 3 e 2 ?TLi v L 3 e 2 L 2 v 

V v ' 

disappears due to L^e2 

v L 3 f 2 "L 2 rT^i v i 3 f 2 "L 2 ei = 
= Li v L 2 (fT v ei) v i 3 i 2 r 2 n" v L 3 e 2 v L^L 2 r2e\ = 
= Li v L 2 rT v L3F2 (e 2 v FT) . 
But (4.53) is what we have in the theorem, because as the concatenation is coherent, 
the third term in (4.53) is zero: 



(4.54) 



i=2 



This is precisely the second term in (4.42), the equation that characterizes coherence. 
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Finally, as L\ = L\ v e\, it is possible to omit e\ and obtain (4.46), recalling that 
rL = L (Prop. 4.1.4). 

Uniqueness can be proven by contradiction. Use equation (4.46) and induction on the 
number of productions. ■ 



*0 4 



• 2 

4 



R 



Fig. 4.12. Non-Compatible Productions (Rep. 



Example. nLet S2 = u; v and s' 2 = v;u (first introduced in Fig. 4.10 on p. 89 and 
reproduced in Fig. 4.12 for convenience of the reader). Minimal initial digraphs for these 
productions are represented in Fig. 4.13. 

The way we have introduced the concept of minimal initial digraph, Mi cannot be 
considered as such because either for sequence u; v or v; u there are subgraphs that permit 
their application. In the same figure the minimal initial digraphs for productions 93; 52; Qi 
and gi; 93; qi are also represented. Productions ^ can be found in Fig. 4.8. 




Fig. 4.13. Minimal Initial Digraph. Examples and Counterexample 



We will explicitly compute the minimal initial digraph for the concatenation 33; q\. 
In this example, and in order to illustrate some of the steps used to prove the previous 
theorem, formula (4.52) is used. Once simplified, it lays the equation: 
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L? v LfiZf v LiR? (e? v i?f ) . 



The ordering of nodes is [2 3 5 1 4]. We will only display the computation for (*), 
being (**) very similar: 



0 0 10 


1 
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1 


0 


0 0 




10 0 1 


1 
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0 0 0 0 


0 
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0 0 




10 11 
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0 


0 


10 10 


0 


V 
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0 


1 


0 0 




10 0 1 
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1 


0 


0 


0 0 0 0 


0 
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0 0 




1111 
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0 


0 
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0 0 0 0 


0 
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0 


0 


0 0 




1111 
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0 


0 


0 



(*) V (**) = 



0 0 10 1 
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0 0 0 1 0 
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0 0 0 0 0 
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0 0 0 0 0 
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0 


3 


10 10 0 
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1 0 0 0 0 
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0 


5 


0 0 0 0 0 


1 




0 0 0 1 0 


1 




0 


0 


0 


1 


0 


1 


0 0 0 0 0 


4 




0 0 0 0 0 


4 




0 


0 


0 


0 


0 


4 



Depicted on the center of Fig. 4.13.B 

A closed formula for the effect of the application of a coherent concatenation can be 
useful if we want to operate in the general case. This is where next corollary comes in. 

Corollary 4.4.3 Let s n = p n ;...;pi be a coherent concatenation of productions, and 
M n its minimal initial digraph as defined in (4-46)- Then, 

n 

s n (M*) = j\ (if ) v A™ (if rf ) (4.55) 

2=1 

n 

^AMe) = /\ (Wm£) v A? (if ) (4.56) 
i=i 

Proof 

□Theorem 4.4.2 proves that s n (M,f ) = s n (V"=i ^*)- To derive the formulas apply in- 
duction on the number of productions and (4.10).B 

Remark. ^Equation (4.56) will be uscfull in Sec. 4.5 to calculate the compatibility of 
a sequence. More interestingly, note that equation (4.55) has the same shape as a single 
production p = r veL, where: 

n 

s = Apf) 

i — 1 

r = A™ (if r*) . 
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However, in contrast to what happens with a single production, the order of appli- 
cation does matter, being necessary to carry out deletion first and addition afterwards. 
The first equation are those elements not deleted by any production and the second is 
what a grammar rule adds and no previous production deletes (previous with respect to 
the order of application). 

Equation (4.55) is closely related to composition of a sequence of productions as 
defined in Sec. 4.5, Prop. 4.5.3. This explains why it is possible to interpret a coherent 
sequence of productions as a single production. Recall that any sequence is coherent if 
the appropriate horizontal identifications are performed. I 



lEc 



y i a r. 



• V 



r,4 * 



J- ' 1 



L 2 H Li * r 2 



Fig. 4.14. Formulas (4.46) and (4.57) for Three Productions 

The negation of the minimal initial digraph which appears in identity (4.56) - seen 
in Fig. 4.14 - can be explicitly calculated in terms of nabla: 



M„ 



V™- 1 (L x r y ) v j\L t 



(4.57) 



For the sake of curiosity, if we used formula (4.53) to calculate the minimal initial 
digraph, the representation of its negation is included in Fig. 4.15 for n = 3 and n = 4. 
It might be useful to find an expression using operators A and V for these digraphs. 



4.4.2 Negative Initial Digraph 



Our plan now is to first make explicit all elements that should not be present in a 
potential match of the left hand side of a rule in a host graph, and then characterize 
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Fig. 4.15. Equation (4.53) for 3 and 4 Productions (Negation of MID) 

them for a finite sequence. This is carried out defining something similar to the minimal 
initial digraph, the negative initial digraph. In order to keep our philosophy of making 
our analysis as general as possible (independent of any concrete host graph) only the 
elements appearing on the LHS of the productions that make up the sequence plus its 
actions will be taken into account. 

We will refer to elements that should not be present as forbidden elements. There are 
two sets of elements that for different reasons should not appear in a potential initial 
digraph: 

1. Edges added by the production, as we are limited for now to simple digraphs. 

2. Edges incident to some node deleted by the production (dangling edges). 

To consider elements just described, the notation to represent productions is extended 
with a new graph ./V that we will call the nihilation matrix. 22 Note that the concept 
of grammar rule remains unaltered because we are just making explicit some implicit 
information. 

To further justify the naturality of this matrix let's oppose its meaning to that of the 
LHS and its interpretation as a positive application condition (the LHS must exist in the 

22 It will be normally represented by N. Just in case there can be any confusion, we will append 
to it some subscript, e.g. Nl for the left hand side, Nr for the right hand side, N p to make 
explicit some production or N s for some sequence. 
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host graph in order to apply the grammar rule). In effect, TV can be seen as a negative 
application condition: If it is found in the host graph then the production can not be 
applied. We will dedicate a whole chapter (Chap. 7) to develop these ideas. 23 

The order in which matrices are derived is enlarged to cope with the nihilation matrix 

N: 

(L, R) i — > (e, r) i — > N. (4.58) 

Otherwise stated, a production is statically determined by its left and right hand sides 
p = (L, R), from which it is possible to give a dynamic definition p = (L, e, r), to end up 
with a full specification including its environmental 24 behaviour p = (L, N, e, r). 

Definition 4.4.4 (Production - Dynamic Formulation) A production p is a mor- 
phism 25 between two simple digraphs L and R, and can be specified by the tuple 

p = (L E , N E , e E , r E ,L N , N N , e N , r N ) . (4.59) 

Compare with definition 4.1.1, the static formulation of production. As commented 
earlier in the book, it should be possible to consider nodes and edges together using the 
tensorial construction of Chap. 8. 

Next lemma shows how to calculate N using the production p, by applying it to a 
certain matrix: 

Lemma 4.4.5 (Nihilation matrix) Using tensor notation (see Sec. 2.4) let's define 
D = e N (x) (e^) , where t denotes transposition. Then, 

N E =p(D). (4.60) 

Proof 

□The following matrix specifies potential dangling edges incident to nodes appearing in 
the left hand side of p: 

23 In a negative application condition we will be allowed to add information of what elements 
must not be present. Probably it is more precise to speak of an implicit negative application 
condition. 

24 Environmental because N specifies some elements in the surroundings of L that should not 
exist. If the LHS has been completed - probably because it belongs to some sequence - then 
the nihilation matrix will consider those nodes too. 

25 In fact, a partial function since some elements in L do not have an image in R. 
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D 



if {e*f = 
otherwise. 



1 or (e 3 ) 



N 



(4.61) 



Note that D = e N (x) (e N ) . Every element incident to a node that is going to be 
deleted becomes dangling except edges deleted by the production. In addition, edges 
added by the rule can not be present, thus we have N E = r E v e E \D\ = p (-D)-B 
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Fig. 4.16. Example of Nihilation Matrix 



Example. □ We will calculate the elements appearing in lemma 4.4.5 for the produc- 
tion of Fig. 4.16: 









t 




0 




0 




1 1 1 


1 


® 


1 




1 0 0 


1 




1 




1 0 0 



The nihilation matrix is given by equation (4.60): 





0 0 0 




1 0 0 




1 1 1 




1 0 0 


N = r veD = 


0 10 


V 


1 1 0 




10 0 




1 1 0 




0 1 1 




1 1 1 




1 0 0 




1 1 1 



This matrix shows that node 1 can not have a self loop (it would become a dangling 
edge as it is not deleted by the production) but edges (1,2) and (1,3) may be present 
(in fact they must be present as they belong to L). Edge (2, 1) must not exist for the 
same reason. The self loop for node 2 can not be found because it is added by the rule. A 
similar reasoning tells us that no edge starting in node 3 can exist: The self loop and edge 
(3, 2) because they are going to be added and (3, 1) because it would become a dangling 
edge.B 
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Let us move on to sequences. The negative initial digraph N s for a coherent sequence 
s n = Pn] ■ ■ ■ '1P1 is the biggest simple digraph whose elements can not be found in the 
host graph to guarantee the applicability of s n . 26 It is the symmetric concept to minimal 
initial digraph, but for nihilation matrices. 

Definition 4.4.6 (Negative Initial Digraph) Let s n = p n ; . . . \p\ be a completed se- 
quence, a negative initial digraph is a simple digraph containing all elements that can 
spoil any of the operations of s n . 

Negative initial diagraphs depend on the way productions are completed (minimal 
initial digraphs too). In fact, as minimal and negative initial digraphs are normally cal- 
culated at the same time for a given sequence, there is a close relationship between them 
(in the sense that one conditions the other). This concept will be addressed again in Sec. 
5.3, together with minimal initial digraphs and initial sets. 

Theorem 4.4.7 Given a completed coherent sequence of productions s n = p n ;...;pi, 
the negative initial digraph is given by the equation: 

N s = V? (e x N y ) . (4.62) 

Proof (Sketch) 

□We can prove the result taking into account elements added by productions in the 
sequence but not dangling edges for now. 

Let's concentrate on what should not be found in the host graph assuming that what 
a production adds is not interfered by actions of previous productions. Note that this is 
coherence, assumed by hypothesis. Consider for example sequence s 2 = P2',Pi- Coherence 
detects those elements added by both productions {r\r 2 = 0) and also if p 2 adds what 
Pi uses but does not delete (r 2 eiL 2 = 0). 27 Hence, we may not care about them. In the 
proof of theorem 4.4.2, the final part precisely addresses this point. 

26 It is not possible to speak of applicability because we are not considering matches yet. This 
is just a way to intuitively introduce the concept. 

27 This is precisely the part of coherence (equation 4.42) not used in the proof of theorem 

4.4.2, the one for minimal initial digraphs: V™=i \^-f V?+i ( e f ''«')]■ Another reason for the 
naturality of N. 
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Now we proceed by induction. The case for one production p\ considers elements 
added by pi, i.e. r%. For two productions S2 = P2',Pi, besides what p\ rejects, what P2 
is going to add can not be found, except if pi deleted it: r\ v r 2 ei- Three productions 
■S3 = P3',p2',Pi should reject what S2 rejects and also what P3 adds and no previous 
production deletes: r\ v T2~e\ v r^^ei. We are using coherence here because the case in 
which pi deletes edge e and P2 adds edge e (we should have a problem if p$ also added 
e) is ruled out. By induction we finally obtain: 

V?=i (e x r y ) . (4.63) 

Now, instead of considering as forbidden only those elements to be appended by a 
production (not deleted by previous ones), any potential dangling edge 28 is also taken 
into account, i.e. r y can be substituted by N y (note that e a N a = N a ). A detailed proof 
may follow the steps of the demonstration of Theorem 4.4.2.B 




21 21 




Fig. 4.17. Productions qi, qi and qs (Rep.) 

Example. □Recall productions q\ (Fig. 4.3 on p. 77), q2 and (Fig. 4.4 on p. 81), 
reproduced in Fig. 4.17 for convenience of the reader. We will calculate the negative 

28 Of course edges incident to nodes considered in the productions. There is no information at 
this point on edges provided by other nodes that might be in the host graph (to distance one 
to a node that is going to be deleted). 
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initial digraph for sequence S3 = q 3l q2]q\. Its minimal initial digraph can be found in 
Fig. 4.13, on p. 95). 

Expanding equation (4.62) for S3 we get: 

N = Ni v eiN 2 v eie 2 N 3 . (4.64) 




N 9 




N 




Fig. 4.18. NID for s 3 = q 3 ;q 2 ;qi (Bold = Two Arrows) 



In Fig. 4.18 we have represented negative graphs for the productions (Ni) and graph 
N for s 3 . As there are quite a lot of arrows, if two nodes are connected in both directions 
then a single bold arrow is used. Adjacency matrices (ordered [2 4 5 3 1]) for first three 
graphs are: 



Ni 



0 0 0 1 0 
11111 
0 10 10 
0 10 10 
0 10 0 0 



No 



0 10 0 0 




0 0 0 1 0 


0 0 0 0 0 




11111 


1 0 0 0 0 


; N 3 = 


0 10 10 


1 0 0 0 0 




0 10 10 


0 0 0 0 0 




0 10 0 0 



The rest of matrices and calculations arc omitted for space considerations. ■ 

Matrix N provides information on what will be called internal e -productions in Sec. 
5.4. These e-productions are grammar rules automatically generated to deal with dangling 
edges. We will distinguish between internal and external, being internal (to the sequence) 
those that deal with edges added by a previous production. 
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4.5 Composition and Compatibility 

Next we are going to introduce compatibility for sequences (extending definition 4.1.5) 
and also composition. Composition defines a unique production that to a certain extent 29 
performs the same actions as its corresponding sequence (the one that defines it). 

Recall that compatibility is a means to deal with dangling edges, equivalent to the 
dangling condition in DPO. When a concatenation of productions is considered, we are 
not only concerned with the final result but also with intermediate states - partial results 
- of the sequence. Compatibility should take this into account and thus a concatenation 
is said to be compatible if the overall effect on its minimal initial digraph gives as result 
a compatible digraph starting from the first production and increasing the sequence until 
we get the full concatenation. We should then test compatibility for the growing sequence 
of concatenations S = {s\, S2, ■ ■ ■ , s n } where s m = q m ] q m -i', ■ ■ ■ Qi, 1 ^ m ^ n. 

Definition 4.5.1 A coherent sequence s n = q n : . . . ; qi is said to be compatible if the 
following identity is verified: 



Corollary 4.4.3 - equations (4.55) and (4.56) - give closed form formulas for the terms 
in (4.65). 

Of course this definition coincides with 4.1.5 for one production and with 2.3.2 for 
the case of a single graph (when considering the identity production, for example). 

Coherence examines whether actions specified by a sequence of productions are fea- 
sible. It warns us if one production adds or deletes an clement that should not, as some 
later production might need it to carry out an operation that becomes impossible. Com- 
patibility is a more basic concept because it examines if the result is a digraph, that is, 
if the class of all digraphs is closed under the operations specified by the sequence. 

Example. ^Consider sequence s 3 = q 3 ;q 2 ;qi, with qi as defined in Figs. 4.3 and 4.4. 
In order to check equation (4.65) we need the minimal initial digraphs Mi (the LHS of 

29 If a production inside a sequence deletes a node and afterwards another adds it, the overall 
effect is that the node is not touched. This may affect the deletion of dangling edges in an 
actual host graph (those incident to some node not appearing in the producitons) . 



n 



m—1 




(4.65) 
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M- 



s 2 {M 21 ) 



s 2 (M 2l )= 



0 0 10 1 

0 0 0 0 0 

10 10 0 

0 0 0 0 0 

0 0 0 0 0 

0 0 10 1 
110 0 0 

1 0 0 0 0 
0 0 0 0 0 
0 0 0 0 0 



Fig. 4.19. Minimal Initial Digraphs for S2 = <l2',qi 



<7i), M21 (coincides with the LHS of qi) and M321 that can be found in Figs. 4.19 and 
4.20 on p. 107. 

Equation (4.65) for m = 1 is compatibility of production q\ which has been calculated 
in the example of p. 77. For m = 2 we have 



[s 2 (Mi) v (s 2 (Mi)) t ] Q s 2 (Mii) 



(4.66) 



which should be zero with nodes ordered as before, [2 3 5 1 4]. The evolution of the vector 
of nodes is [1 0 1 0 1] 3* [1 1 1 0 0] A [1 1 1 0 1]. Making all substitutions according 
to values displayed in Fig. 4.19 we obtain: 



(4.66) 



As commented above, we can make use of identities (4.55) and (4.56). The case m = 3 
is very similar to m = 2. There is another example below (on p. 108) with the graphical 
evolution of the states of the system. ■ 

Once we have seen compatibility for a sequence the following corollary to theorems 
4.4.2 and 4.4.7 can be stated: 



/ 


00101 




0 110 0 






0 




0 


2 




11000 




0 10 0 0 






0 




0 


3 




10000 


V 


1 0 0 0 0 




0 


0 




0 


5 




00000 




0 0 0 0 0 






1 




0 


1 


V 


00000 




1 0 0 0 0 


J 




0 




0 


4 



Corollary 4.5.2 Let M be a minimal initial digraph and N the corresponding negative 
initial digraph for a coherent and compatible sequence, then M a N = 0. 
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Proof 

□Just compare equations M = V™ (r^L y ) and N = V™ (e x N y ). We know that elements 
added and deleted by a production are disjoint. This implies that the negation of the 
corresponding adjacency matrices have no common elements. ■ 

Intuitively, if we interpret matrices M and N as elements that must be and must not 
be present in a potential host graph in order to apply the sequence, then it should be 
clear that Lj and Ni must also be disjoint. This point will be addressed in Chap. 7. 

So far we have presented compatibility and the minimal initial digraph and will finish 
studying composition and the circumstances under which it is possible to define a single 
production starting with a coherent concatenation. 

When we introduced the notion of production, we first defined its LHS and RHS 
and then we associated some matrices (e and r) to them. The situation for defining 
composition is similar, but this time we first observe the overall effect (its dynamics, i.e. 
matrices e and r) of the production and then decide its left and right hand sides. 

Assume s n = p n ; . . . ;p± is coherent, then the composition of its productions is again 
a production defined by the rule c = p n o p n _i o...opj. 30 The description of its erasing 
and its addition matrices e and r are given by equations: 

n 

S E = I> [rf - ef ) (4.67) 

i = l 
n 

S N = Z W ~ e H ■ ( 4 - 68 ) 

i = l 

Due to coherence we know that elements of S E and S N are either +1, 0 or —1, so 
they can be split into their positive and negative parts, 

S E =r E_ e E j S N =r N_ e N^ (46Q) 

where all r + and e_ elements are either zero or one. We have: 

Proposition 4.5.3 Let s n = p n ;...;pi be a coherent and compatible concatenation of 
productions. Then, the composition c = p n o p„_i o . . . o p\ defines a production with 
matrices r E = r E , r N = and e E = — e E , e N = — e^ . 

30 The concept and notation are those commonly used in mathematics. 
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Proof 

□Follows from comments above. I 

The LHS is the minimal digraph necessary to carry out all operations specified by 
the composition (plus those preserved by the productions) . As it is only one production, 
its LHS equals its erasing matrix plus preserved elements and its right hand side is just 
the image. The concept of composition is closely related to the formula which outputs 
the image of a compatible and coherent sequence. Refer to corollary 4.4.3. 

Note that preserved elements do depend on the order of productions in the sequence. 
For example, sequence S3 = Pz',P2',Pi first preserves (appears in L\ and i?i) then deletes 
(P2) and finally adds (p^) element a. This element is necessary in order to apply S3. 
However, the permutation p' 3 = Pi\P\\P3 first adds a, then preserves it and finally deletes 
it. It cannot be applied if the element is present. 

Corollary 4.5.4 With the notation as above, c(M n ) = s n (M n ). 

Composition is helpful when we have a coherent concatenation and intermediate states 
are useless or undesired. It will be utilized in sequential independence and explicit par- 
allelism (see Sees. 6.2 and 6.4). 




Fig. 4.20. Composition and Concatenation of a non-Compatible Sequence 
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Example. □ We finish this section considering sequence S3 = q^q2]<ii again, cal- 
culating its composition C3 and comparing its result with that of S3. Recall that 



S E (* 3 )=5li (r 



1 



110 0 1 


2 


110 0 0 


3 


1110 0 


5 


0 0 0 0 0 


1 


0 0 0 0 0 


4 



2< 



0 10 0 1 


2 


1 0 0 0 0 


3 


10 10 0 


5 


0 0 0 1 0 


1 


0 0 0 0 0 


4 



1 0 


0 


0 


0 




1 0 0 0 0 




0 0 0 0 0 


0 1 


0 


0 


0 




0 10 0 0 




0 0 0 0 0 


0 1 


0 


0 


0 




0 10 0 0 




0 0 0 0 0 


0 0 


0 


-1 0 




0 0 0 0 0 




0 0 0 1 0 


0 0 


0 


0 


0 




0 0 0 0 0 




0 0 0 0 0 



S E (*s) = 



Sequence S3 has been chosen not only to illustrate composition, but also compatibility 
and the sort of problems that may arise if it is not fulfilled. In this case, (73 deletes node 
3 and edge (3,2) but does not specify anything about edges (3,3) and (3,5) - the red 
dotted elements in Fig. 4.20 -. In order to apply the composition, either the composed 
production is changed by considering these elements or elements have to be related in 
other way (in this case, unrelated). ■ 

Previous example provides us with some clues with the way the match could be de- 
fined. The basic idea is to introduce an operator over the set of productions, so once a 
match identifies a place in the host graph where the rule might be applied, the opera- 
tor modifies the rule enlarging the deletion matrix so that no dangling edge appear (it 
should enlarge the grammar rule to include the context of the original rule in the graph, 
adding all elements on both LHS and RHS). In essence, a match should be an injective 
morphism (in Matrix Graph Grammars) plus an operator. Pre-calculated information for 
coherence, sequentialization, and the like, should help and hopefully reduce the amount 
of calculations during runtime. We will study this in Chap. 5. 

This section ends noting that, in Matrix Graph Grammars, one production is a mor- 
phism between two simple digraphs and thus it may carry out just one action on each 
element. When the composition of a concatenation is performed we get a single produc- 
tion. Suppose one production specifies the deletion of an element and another its addition, 
the overall mathematical result of the composition should leave the element unaltered. 
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When a match is considered, depending on the chosen approach, all dangling edges in- 
cident to those erased nodes should be removed, establishing an important difference 
between a sequence and its composition. 

4.6 Summary and Conclusions 

In this chapter we have introduced two equivalent definitions of production, one empha- 
sizing the static part of grammar rules and the other stressing its dynamics. 

Also, completion has been addressed. To some extent it allows us to study productions, 
forgetting about the state to which the rule is to be applied. It provides a mean to relate 
elements in different graphs, a kind of horizontal identification of elements among the 
rules in a sequence. 

Sequences of productions have been introduced together with compatibility and co- 
herence. The first ensures that the underlying structure (simple digraph) is kept, i.e. it is 
closed under the operations defined in the sequence. Coherence guarantees that actions 
specified by one production do not disturb productions following it. 

Coherence can be compared with critical pairs, used in the categorical approach to 
graph grammars to detect conflicts between grammar rules. There arc differences though. 
The main one is that coherence in our approach covers any finite sequence of productions 
while critical pairs are limited to two productions. Among other things, coherence would 
be able to detect if a potential problem between two productions is actually fixed by 
some intermediate rule. 

Minimal and negative initial digraphs arc of fundamental importance, demanding the 
minimal (maximal) set of elements that must be found (must not be found) in order 
to apply the sequence under consideration. In particular they will be used to give one 
characterization of the applicability problem (problem 1). 

Also, composition and the main differences between this and concatenation have 
been addressed. Composition can be a useful tool to study concurrency. Recall from Sec. 
4.5 that differences in the image of the composition are not due to the order in which 
operations are performed but in those elements needed by the productions, i.e. in the 
initial digraph. This also gives information on initial digraphs and its calculation. This 
topic - which we call G-congruence - will be addressed in deeper detail in Sec 6.1. 
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So far we have developed some analytical techniques independent (to some extent) 
of the initial state of the system to which grammar rules are applied. This allows us to 
obtain information about grammar rules themselves, for example at design time. This 
information may be useful during runtime. We will return to this point in future chapters. 

Chapter 5 starts with the semantics of a grammar rule application, so a host graph 
or initial state will be considered. Among other things the fundamental concept of direct 
derivation is introduced. We will see what can be recovered of what we have developed 
so far and how it can be used. 



5 



Matching 



There are two fundamental parts in a grammar: Actions to be performed in every single 
step (grammar rules) and where these actions are to be performed in a system (matching). 
Previous chapter deals with the former and this chapter with the latter. Also, restrictions 
on the aplicability of rules and their embedding in the host graph need to be addressed. 
This topic is studied in Chap. 7. 

If a rule is applied we automatically have the pair (production, match) - normally 
called direct derivation - which in essence specifies what to do and where to do it. If 
instead of a single rule we consider a sequence with their corresponding matches then we 
will speak of derivation. These initial definitions, together with the matching are studied 
in Sec. 5.1 in which we will make use of some functional analysis notation (see Sec. 2.5). 
When a match is considered, there is the possibility that a new production (so called 
e- production) is concatenated to the original one. 1 Both productions must be applied 
(matched) to the same nodes. The mechanism to obtain this effect can be found in Sec. 
5.2 (marking). An important issue is to study to what extent the notions introduced at 
specification time (coherence, composition, etcetera) can be recovered when a host graph 
is considered. They will be revisited considering minimal and negative initial digraphs 
(see Sec. 4.4) in a wider context in Sec. 5.3. A classification of e-productions - helpful in 
Chap. 8 - is accomplished in Sec. 5.4. The chapter ends with a summary in Sec. 5.5. 

1 e-productions take care of those edges - dangling edges - not specified by the production and 
incident to some node that is going to be deleted. 
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5.1 Match and Extended Match 

Matching is the operation of identifying the LHS of a rule inside a host graph. This 
identification is not necessarily unique, becoming one source of non determinism. 2 The 
match can be considered as one of the ways of completing L with respect to G. 

Definition 5.1.1 (Match) Given a production p : L — » R and a simple digraph G, any 
tuple m = (tol,tojv) is called a match (for p in G), with ■ L — » G and ■ N E — > 
G E total injective morphisms. Besides, 



The two main differences with respect to matches as defined in the literature is that 
definition 5.1.1 demands the non-existence of potential problematic elements and that m 
must be injective. 

It is useful to consider the structure defined by the negation of the host graph, G = 
(G E ,G N ). It is made up of the graph G E and the vector of nodes G N . Note that the 
negation of a graph is not a graph because in general compatibility fails. Of course, the 
adjacency matrix alone ( G E ) does define a graph. 



The negation of a graph is equivalent to taking its complement. In general this com- 
plement will be taken inside some "bigger graph" , normally constructed by performing 
the completion with respect to other graphs involved in the operations. For example, 
when checking if graph A is in G E (suppose that A has a node that is not in G) wc 
obtain that A cannot be found in G E , unless G E is previously completed with that node 
and all its incident edges. 

Notice that the negation of a graph G coincides with its complement. Probably it 
should be more appropriate to keep the negation symbol (the overline) when there is no 
completion (in other words, complement is taken with respect to the graph itself) and 
use c when other graphs are involved. From now on the overline will be used in all cases. 
This notational abuse should not be confusing. 

2 In fact there are two sources of non-determinism. Apart from the one already mentioned, the 
rule to be applied is also chosen non-deterministically. 



fTi£,(n) = m,N(n), VneL 



(5.1) 
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Next, a notion of direct derivation that covers not only elements that must be present 
(L) but also those that should not appear (N) is presented. This extends the concept of 
derivation found in the literature, which only considers explicitly positive information. 



R 



G 



V 

■■>• H 



Fig. 5.1. Production Plus Match (Direct Derivation) 



Definition 5.1.2 (Direct Derivation) Given a production p : L — » R as in Fig. 5.1 
and a match m = (to£,totv)j d = (p,m) is called a direct derivation with result H = 
p* (G) if the square is a pushout: 

m* L op(L) =p*om L (L). (5.2) 

(p,m) 

The standard notation in this case is G =^> H , or even G =^> H if p, m or both are 
not relevant. 

We will see below that it is not necessary to rely on category theory to define direct 
derivations in Matrix Graph Grammars. It is included to ease comparison with DPO and 
SPO approaches. 

Figure 5.1 displays a production p and a match m for p in G. It is possible to close the 
diagram making it commutative (m* op = p* o m), using the pushout construction [22] 
on category Graph p of simple digraphs and partial functions. This categorical construc- 
tion for relational graph rewiting is carried out in [51]. See Sec. 3.6 for a quick overview 
on the relational approach. 3 

3 There is a slight difference, though, as we have a simpler case. We demand matchings to be 
injective which, by Prop. 2.6 in [51], implies that comatches are injective. 
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If a concatenation s = p n ; . . .\p\ is considered together with the set of matchings 
m = {mi, . . . , m„}, then d = (s, m) is a derivation. In this case the notation G ^=>* H 
is used. 

When applying a rule to a host graph, the main problem to concentrate on is that of 
so-called dangling edges, which is differently addressed in DPO and SPO (see Sees. 3.1 
and 3.2). In DPO, if one edge comes to be dangling then the rule is not applicable for 
that match. SPO allows the production to be applied by deleting any dangling edge. 

For Matrix Graph Grammars we propose an SPO-like behaviour as in our case a DPO- 
like behaviour 4 would be a particular case if compatibility is considered as an application 
condition (see Chap. 7). 5 




Fig. 5.2. (a) Neighbourhood, (b) Extended Match 

Figure 5.2 shows our strategy to handle dangling edges: 

1. Complete L with respect to G (c and c* on the left of Fig. 5.2). It is necessary to 
match L in G to this end. 6 

2. Morphism tul will identify rule's left hand side (after completion) in the host graph. 

4 In future sections we will speak of fixed and floating grammars, respectively. 

5 If e-productions are not allowed and a rule can be applied if the output is again a simple 
digraph (compatibility) then we obtain a DPO-like behaviour. 

6 Abusing a little of the notation, graphs before completion and after completion are represented 
with the same letter, L and R. 
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3. A neighbourhood of m(L) c G covering all relevant extra elements is selected taking 
into account all dangling edges not considered by match tul with their correspond- 
ing source and target nodes. This is performed by a morphism to be studied later, 
represented by m £ . 

4. Finally p is enlarged erasing any potential dangling edge. This is carried out by an 
operator that we will write as T E . See definition below. 

The order of previous steps is important as potential dangling elements must be 
identified and erased before any node is deleted by the original rule. 

The coproduct in Fig. 5.2 should be understood as a means to couple L and G. The 
existence of a morphism p* that closes the top square on the right of Fig. 5.2 is not 
guaranteed. This is where m e comes in. This mapping, as explained in point 2 above, 
extends the production to consider any edge to distance 1 from nodes appearing in the 
left hand side of p. The idea may resemble analytical continuation in complex variable, 
when a function defined in a smaller domain is uniquely extended to a larger one. 

Note that if it is possible to define p* (to close the square) then m e would be the 
identity, and viceversa. In other words, if there are no dangling edges then it is possible 
to make the top square in Fig. 5.1 commute and, hence, it is not necessary to carry out 
any production "continuation" . The converse is also true. 

r n m(L) s- m(L) 

\ V 

/' > / u m(L) 

Fig. 5.3. Match Plus Potential Dangling Edges 

Let be given a production p : L — » R, a host graph G and a match m : L — » G. 
The graph r is the set of dangling edges together with their source and target nodes. 
Abusing a little bit of notation (justified by the pushout construction in Fig. 5.3) we will 
write r u m(L) for the graph consisting of the image of L by the match plus its potential 
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dangling edges (and any incident node). Recall nihilation matrix definition, especially 
lemma 4.4.5. 

Definition 5.1.3 (Extended Match) With notation as above (refer also to Fig. 5.2), 
the extended match fh : L + G — > G is a morphism with image r u m (L) . 

As commented above, coproduct in Fig. 5.2 is used just for coupling L and G, being 
the first embedded into the second by morphism m^. We will use the notation 

L d = tuq (L) = (m e o m) (L) (5-3) 

when the image of the LHS is extended with its potential dangling edges, i.e. extended 
digraphs are underlined and defined by composing m and m e J 
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Fig. 5.4. Matching and Extended Match 



Example. ^Consider the digraph Li, the host graph G and the morphism match 
depicted on the left of Fig. 5.4. On the top right side in the same figure toi(Li) is drawn 
and m<3 (Li) on the bottom right side. Nodes 2 and 3 and edges (2,1) and (2,3) have 

7 There is a notational trick here, where "continuation" is represented as composition of mor- 
phisms (m_L o m e ). This is not correct unless, as explained in Sec. 4.2, matrices are completed. 
Recall that completion extends the domain of morphisms (intepreting matrices as morphisms 
between digraphs). This is precisely step 1 on p. 1. 
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been added to tog (L) which would become dangling in the image "graph" of G by p\ (as 
it can not be defined it has been drawn shadowed) . This is the reason why p* can not be 
defined, as node (1 : C) would be deleted but not edges (1 : C, 2 : C) nor (1 : C, 1 : S). 
H' would not be a digraph. 

As commented above, the composition is performed because toi and m £i i are functions 
between boolean matrices that have been completed. ■ 

Actually it is not necessary to rely on category theory to define direct derivations. 
The basic idea is given precisely by that of analytical continuation. What morphism m e 
does is to extend the left hand side of the production, i.e. it adds elements to L. As 
matches are total functions, they can not delete elements (nodes or edges) in contrast 
with productions. 

Hence, a match can be seen as a particular type of production with left hand side 
L and right hand side G. The LHS of the production is enlarged with any potential 
dangling edge and the same for the RHS except for edges incident to nodes deleted by 
the production (as they are not added to its RHS, these edges will be deleted). This way, 
a direct derivation would be 

H=p(m(L)). (5.4) 

Advancing some material from next section, m is essentially used to mark nodes in 
which p acts. Production p is the identity in almost all elements except in some nodes 
(edges) marked by to. 8 

The rest of the section is devoted to the interpretation of this "continuation technique" 
as a production, in particular that of m e . 

Once we are able to complete the rule's LHS we have to do the same with the rest of 
the rule. To this end we define an operator T e : (5 — > 25', where © is the original grammar 
and & is the grammar transformed once T e has modified the production. In words, T £ 
extends production p such that T £ (p) has the same effect than p but also deletes any 
dangling edge. 

The notation that we use from now on is borrowed from functional analysis (see Sec. 
2.5). Bringing this notation to graph grammar rules, a rule is written as R = (L,p) 

8 Note that p's erasing and addition matrices, although as big as the entire system state - 
probably huge -, are zero almost everywhere. 
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(separating the static and dynamic parts of the production) while the grammar rule 
transformation including matching is: 

R = (m G (L),T £ p). (5.5) 

Proposition 5.1.4 With notation as above, production p can be extended to consider 
any dangling edge, R = (mg (L) , T £ p). 

Proof 

□What we do is to split the identity operator in such a way that any problematic element 
is taken into account (erased) by the production. In some sense, we first add elements 
to p's LHS and afterwards enlarge p to delete them. Otherwise stated, m G = Xjr 1 and 
T* = tuq 1 , so we have: 

R = (L,p) = (L, (T- 1 o T £ ) p) = <m G (L) , T £ (p)> = R. 

The equality R = Ris valid only for edges as R N has source and target nodes of dangling 
edges. ■ 

The effect of a match can be interpreted as a new production concatenated to the 

def 

original production. Let p e = T* , 

R = <m G (L) , T £ (p)> = <T e * (m G (L)) ,p)= (5.6) 
= p (T* (ma (L))) = p ; p £ ; m G (L) = p; p e (L) . 

Production p e is the e-production associated to production p. Its aim is to delete 
potential dangling edges. The dynamic definition of p e is given in (5.7) and (5.8). 

The fact of taking the match into account can be interpreted as a temporary modifi- 
cation of the grammar, so it can be said that the grammar modifies the host graph and 
the host graph interacts with the grammar (altering it temporarily). 

If we think of m G and T* as productions respectively applied to L and ma (L), it 
is necessary to specify their erasing and addition matrices. To this end, recall matrix D 
defined in lemma 4.4.5, with elements in row i and column i equal to one if node i is to 
be erased by p and zero otherwise, which considers any potential dangling edge. 

For mc we have that e N = e E = 0, and r = LL (for both nodes and edges), 
as the production has to add the elements in L that are not present in L. Let p £ = 
( e T e ' r T e ' e T e ' r T e ) > then 
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e% = D a L E . 



(5.7) 
(5.8) 



Example.:: Consider rules depicted in Fig. 5.5, in which serverDown is applied to 
model a server failure. We have: 



L 



0 1 



L 



N 



.[.I. 



1 1 



R 



N 



0- 



Once ma = (L E , L N , r E , 0, 0, 0) and operator T E have been applied, giving rise to 
p £ = (L E ,L n , 0, 0, efi , 0), the resulting matrices are: 



0 0 0 




0 0 0 


10 0 


;L E = 


1 0 0 


1 0 0 




1 0 0 



0 0 
0 0 



E 



0 0 0 

1 0 0 
1 0 0 



where ordering of nodes is [1 : S, 1 : C, 2 : C] for matrices r E , L E and e E s and [1 : C, 2 : C] 
for R E . Matrix r E , besides edges added by the production, specifies those to be added by 
mc to the LHS in order to consider any potential dangling edge (in this case (1 : C, 1 : S) 
and (2 : C, 1 : S)). As neither ma nor production serverDown delete any element, e E = 0. 
Finally, p e removes all potential dangling edges (check out matrix ef? ) but it does not 
add any, so r E = 0. Vectors for nodes have been omitted. ■ 
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Fig. 5.5. Full Production and Application 
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Let T* = \T* , T* J be the adjoint operator of T e . We will end this section giving 
an explicit formula for T*. Dehne ef and rf respectively as the erasing and addition 
matrices of T e (p). It is clear that rf = r E = r E and ef = e E v D L E , so 

R E = (L E ,T E (p) > = rf v "ef L E = r E v (e E vDL E )L E = 

= r B v(flvP) = r B v 'e E DL E . (5.9) 

Previous identities show that i? E = (LF ,T E (p E )y = (D L E which proves the 
identity: 

T £ * = (T* N ,T* E> j = (id, D) . (5.10) 

Summarizing, when a match m is considered for a production p, the production 
itself is first modified in order to consider all potential dangling edges. Morphism m is 
automatically transformed into a match which is free from any dangling element and, in 
a second step, a pre-production p £ is appended to form the concatenation 9 

p*=p*;p*. (5.11) 

Note that as injectiveness of matches is demanded, there is no problem such as ele- 
ments identified by matches that on one hand are deleted and on the other are kept. 

Depending on the operator T £ , side effects are permitted (SPO-like behaviour) or 
not (DPO-likc behaviour). A fixed grammar or fixed Matrix Graph Grammar is one in 
which (mandatorily) the operator T e is the identity. If the operator is not forced to be 
the identity, we will speak of a floating grammar or floating Matrix Graph Grammar. 
Notice that the existence of side effects is equivalent to transforming a production into a 
sequence. This will also be the case when we deal with graph constraints and application 
conditions (Chap 7). 

5.2 Marking 

In previous section the problem of dangling edges has been addressed by adding an e- 
production which deletes any problematic edge, so the original rule can be applied as it is. 
9 It is also possible to define it as the composition: p* = p* op*. 
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However there is no way to guarantee that both productions will use the same elements 
(recall that in general matches are non-deterministic). The same problem exists with 
application conditions (Sec. 7.3) or whenever a rule is split into subrules and applying 
them to the same elements in the host graph is desired. 

This topic is studied in [70] (for a different reason) and the solution proposed there is 
to "pass" the match from one production to the other. We will tackle this problem in a 
different way that consists in defining an operator T M)Q for a label a acting on production 
p as follows: 

• If no node is typed a in p then a new node labeled a is added and connected to every 
already existing node. 

• If, on the contrary there exists a node of that type then it is deleted. 

The basic idea is to mark nodes and related productions with a node of type a. The 
operator behaves differently depending on whether it is marking the state (it adds node 
a) or it is extending the productions (a- typed nodes are removed). 

For an example of a short sequence of two productions, please refer to Fig. 5.6. Using 
functional analysis notation: 

R = (L,p) .— » R = <m e (L),T £ (p)> .— » R = <m £ (L),T M o T e (p)) (5.12) 

where, as in Sec. 5.1, R is the extended rule's RHS that considers any dangling edge. 

If a production is split into two subproductions, say p i — > T e (p) = p;p E and we want 
them to be applied in the same nodes of the host graph, we may proceed as follows: 

• Enlarge p £ to add one node of some non-existent type (a) together with edges starting 
in this node and ending in nodes used by p £ . 

• Enlarge p to delete a nodes mentioned in previous step. 

It is important to note that p must be enlarged to delete only the previously added 
node (a) and not the edges starting in a appended by to p e . The reason is that in case 
of a sequence in which the e-production is advanced several positions, there exists the 
possibility to create unreal dependencies between p and some production applied before 
p but after p e (the example below illustrates this point in particular) . 
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Marking will normally create new e-productions related to p. Note however that no 
recursive process should arise as there shouldn't be any interest in permuting (advancing) 
this new e-productions. 

For e-productions all this makes sense just in case we do not compose pop £ (no marking 
would be needed). Two different operators, one for a nodes addition and another for a 
nodes deletion (instead of just one) are not defined because marking always acts this 
way. This should not cause any confusion. 




Fig. 5.6. Example of Marking and Sequence s = p;p e 



Example. oFigure 5.6 illustrates the process for a simple production p that deletes 
node 1 and is applied to a host graph in which one or two dangling edges (depending on 
the match, 1 or 1') would be generated, (1,2) or (l',2) and (l',3). 

We have chosen node 1 for the match so there should be one dangling edge (1,2). In 
order to avoid it, an e-production p e which deletes (1, 2) is appended to p. 

The marking process modifies p £ and p becoming p e i— > X^(p £ ) and p >—> T fl (p), 
respectively. Note that T^(p) generates two dangling edges - (a, 1) and (a, 2) - so a new 
e-production p' £ oughts to be added. 

When the production is applied, a sequence is generated as operators act on the 
production - p >— > T £ (p) i— > o T £ (p) i— > T £ o T M o T £ (p) - giving rise to the following 
sequence of productions: 



V ' — >P\Pe ' — > ?) t (p);^;T A1 (p e ). 



(5.13) 
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The reason why it is important to specify only the new node deletion (a) and not the 
edges starting in this node is not diffcult but may be a bit subtle. It has been mentioned 
above. The rest of the example is devoted to explaining it. 

If we specified the edges also, say (a, 1) and (a, 2) as above, then the transformed 
production T M (p) would use node 2 as it should appear in its LHS and RHS (remember 
that p did not act on node 2). 

Now imagine that we are interested in advancing the £-production three positions, for 
example because we know that it is external (see Sec. 5.4) and independent: p ; p £ ; p2 ; pi *—> 
P;P2',PitPe- Suppose that production p\ (placed between p and the new allocation of p £ ) 
deletes node 2 and production p2 adds it. If p was sequential independent with respect 
to pi and P2 then it would not be anymore due to the edge ending in node 2 because 
now p uses node 2 (appears in its left and right hand sides). ■ 

Note that as the marking process can be easily automated, we can safely ignore it and 
assume that it is somehow being performed, by the runtime environment for example. 

5.3 Initial Digraph Set and Negative Digraph Set 

Concerning minimal and negative initial digraphs there may be different ways to complete 
rule matrices, depending on the matches. Therefore, we no longer have a unique initial 
digraph but a set (if we assume any possible match). In fact two sets, one for elements 
that must be found in the host graph and another for those that must be found in its 
complement. This section is closely related to Subsecs. 4.4.1 and 4.4.2 and extends results 
therein proved. 

The initial digraph set contains all graphs that can be potentially identified by 
matches in concrete host graphs. 

Definition 5.3.1 (Initial Digraph Set) Given sequence s n , its associated initial di- 
graph set 971 (s„) is the set of simple digraphs M t such that MMi e 9Jt(s„): 

1. Mi has enough nodes and edges for every production of the concatenation to be applied 
in the specified order. 

2. Mi has no proper subgraph with previous property (keeping identifications). 
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Every element Mj e Wl(s n ) is said to be an initial digraph for s n . It is easy to see 
that Vs„ finite sequence of productions we have 071 (s n ) # 0. 

In Sec. 4.3 coherence was used in a more or less absolute way when dealing with 
sequences, assuming some horizontal identification of elements. Now we see that, due 
to matching, coherence is a property that may depend on the given initial digraph so, 
depending on the context, it might be appropriate to say that s n is coherent with respect 
to initial digraph Mi (just in case direct derivations are considered). Note that what we 
fix by choosing an initial digraph is the relative matching of nodes across productions 
(one of the actions of completion). 

For the initial digraph set we can define the maximal initial digraph as the element 
M n e 971 (s„) that considers all nodes in pi to be different. This element is unique up to 
isomorphism, and corresponds to considering the parallel application of every production 
in the sequence, i.e. the LHS of every production in the sequence is matched in disjoint 
parts of the host graph. 

This concept has already been used although it was not explicitly mentioned: In the 
proof of theorem 4.4.2 we started with Vi*=i a digraph that had enough nodes to 
perform all actions specified by the sequence. 

In a similar way, Mj e 97t (s„) in which all possible identifications are performed are 
known as minimal initial digraphs. Contrary to maximal initial digraph, minimal initial 
digraphs need not be unique as the following example shows. 

Example. nln Fig. 5.7 we have represented the minimal digraph set for sequence s 
= removeChannel ; removeChannel. The production is also depicted in the figure where 
S stands for server and C for client. Note that it is not coherent if all nodes in L3 are 
identified because the link between two clients is deleted twice. Therefore, the initial 
digraphs should provide at least (in fact, at most) two different links between clients. 

In the figure we have associated colour orange to the maximal initial digraph and 
blue to two (absolute) minimal initial digraphs (of course they are all minimal for some 
appropriate identification of nodes). Identifications are written as i = j meaning that 
nodes i and j become one and the same. A top-bottom procedure has been followed, 
starting out with the biggest digraph M7 and ending in the smallest. Numbers on labels 
are all different to ease identifications on the initial digraph tree on the the right of Fig. 
5.7.B 
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Fig. 5.7. Initial Digraph Set for s=remove_channel;remove_channel 



It is possible to provide some structure 1 (s n ) to set 9Jt (s„) (see the right side of Fig. 
5.7). Every node in T represents an element of £Dt. A directed edge from one node to 
another stands for one operation of identification between corresponding nodes in LHS 
and RHS of productions of the sequence s n . 

Following with the example above, node M7 is the maximal initial digraph, as it only 
has outgoing edges. Nodes Mi and M3 are minimal as they only have ingoing edges. 
The structure T is an acyclic digraph with single root node (recall that there is just one 
maximal initial digraph), known as graph- structured stack. 

It is possible to make a similar construction for negative initial digraphs that we will 
call negative initial set. It will be represented by 01(s„) where s n is the sequence under 
study. 

Definition 5.3.2 (Negative Initial Set) Given sequence s n , its associated negative 
initial set 01 (s„) is the set of simple digraphs Ni such that ViV^ e 0T(s„): 

1. Ni specifies all edges that can potentially prevent the application of some production 
of s„ . 

2. Ni has no proper subgraph with previous property (keeping identifications) . 
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Fig. 5.8. Negative Digraph Set for s=clientDown; clientDown 

Example. nWe study the sequence s=clientDown; clientDown very similar to that 
in the example of p. 124 but deleting one node and two edges. It is depicted in Fig. 5.8 
and represents the failure of a client connected to a server and to another client. 

The same labeling criteria has been followed to ease comparison. Minimal digraphs are 
very similar to those in Fig. 5.7 and in fact identifications have been performed such that 
Ni corresponds to Mj. Graphs do not include all edges that should not appear because 
there would be many edges, probably being a confusing instead of a clarifying example. 
For instance, in N4 there can not be any edge incident to node 6 : C (except those coming 
from 1 : S and 4 : S), in particular edge (2 : C, 6 : C) which is not represented. Complete 
graph 7V4 can be found in Fig. 5.9. Note that for N4 the order of deletion is important, 
first node (2 : C) and then node (3 : C)M 
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Fig. 5.9. Complete Negative Initial Digraph N4 
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The relationship between elements in 9Jt and is compiled in corollary 4.5.2. Note 
that the cardinality of both sets do not necessarily coincide. In the example of p. 124, 
production s docs not add any edge nor deletes any node (hence, no forbidden element) 
so its negative digraph set is empty. 

Although in this book we are staying at a more theoretical level, we will make a small 
digression on application of these concepts and possible implementations. 

Let's take as an example the calculation of Mo in proposition 6.3.2, which states that 
two derivations d and d! are sequential independent if they have a common initial digraph 
for some identification of nodes, i.e. if 9Jl(d) n Wl(d') 0- We see that it is possible to 
follow two complementary approaches: 

• Top-bottom. Begin with the maximal initial digraph and start identifying elements 
until we get the desired initial digraph or eventually get a contradiction. 

• Bottom-up. Start with different initial digraphs and unrelate nodes until an answer 
is reached. 

In Fig. 5.7 on p. 125 cither we begin with M7 and start identifying nodes, eventually 
getting any element of the minimal initial set, or we start with Mi - which is not neces- 
sarily unique - and build up the whole set, or stop as soon as we get the desired minimal 
initial digraph. 

Let the matrix filled up with l's in all positions be represented by 1. For the first 
case the following identity may be of some help: 

M d = M d , M d M d , v M d M d , = 1. (5.14) 

A SAT solver can be used on (5.14) to obtain conditions, setting all elements in M 
as variables except those already known. In order to store M, binary decision diagrams 
- BDD - can be employed. Refer to [8]. 

The same alternative processes might be applied to the negative initial set to even- 
tually reach any of its elements. 

5.4 Internal and External ^-productions 

Dangling edges can be classified into two disjoint sets according to the place where they 
appear, whether they have been added by a previous production or not. 
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For example, given sequence P2IP1, suppose that rule p\ uses but does not delete edge 
(4, 1), that rule P2 specifies the deletion of node 1 and that we have identified both nodes 
1. It is mandatory to add one £-production p e ^ to the grammar with the disadvantage 
that there is an unavoidable problem of coherence between p\ and p e .i if we wanted to 
advance the application of p e _2 to pi, i.e. they are sequentially dependent. 

Hence, edges of e-productions are of two different types: 

• External. Any edge not appearing explicitly in the grammar rules, i.e. edges of the 
host graph "in the surroundings" of the actual initial digraph. 10 Examples are edges 
(1 : C, 1 : S) and (2 : C, 1 : S) in Fig. 5.5 on p. 119. 

• Internal. Any edge used or appended by a previous production in the concatenation. 
One example is edge (4, 1) mentioned above. 

e-productions can be classified in internal e-productions if any of its edges is internal 
and external e-productions otherwise. 

The "advantage" of internal over external e-productions is that the former can be 
considered (are known) during rule specification while external remain unknown until 
the production is applied. This, in turn, may spoil coherence, compatibility and other 
calculations performed during grammar definition. 

On the other hand, external e-productions do not interfere with grammar rules so 
they can be advanced to the beginning and even composed to get a single production if 
so desired (these are called exact derivations, defined below). 
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Example. nLet's consider derivation <i 2 = P2',Pi (see Fig. 5.10). Edge (1,2) in graph 
Gi is internal (it has been added by production pi) while edge (2, 3) in the same graph 
is external (it already existed in Gq)M 

Given a host graph G in which s n - coherent and compatible - is to be applied, 
and assuming a match which identifies s„'s actual initial digraph (M„) in G (defining 
a derivation d n out of s n ), we check whether for some in and T £ , which respectively 
represent all changes to be done to M n and all modifications to s n , it is correct to write 

H n = d n {M n ) = (m{M n )X{s n )), (5.15) 

where H n is the subgraph of the final state H corresponding to the image of M n . 

Equation (4.55) allows us to consider a concatenation almost as a production, justi- 
fying operators T e and in and our abuse of notation (recall that bra and kets apply to 
productions and not to sequences). 

All previous considerations together with the following example are compiled into the 
definition of exact sequence. 

Example. nLet s 2 = P2',Pi be a coherent and compatible concatenation. Using oper- 
ators we can write 

H = <m G , 2 «m G ,i (M 2 ),T ei i (pi)»,T e , 2 (p 2 )>, (5.16) 

which is equivalent to H = p 2 ; Pe&Pi', Pe,i ( M2 ) , with actual initial digraph twice mod- 
ified M2 = tog, 2 (tog,i (M 2 )) = (m Gt 2 o tog,i) (M 2 ). ■ 

Definition 5.4.1 (Exact Derivation) Let d n = (s n ,m n ) be a derivation with actual 
initial digraph M n , sequence s n = p n ; . . . ;p\, matches m„ = {tog,i, • • • ,TOG,n} o-nd e- 
productions {p e .i, ■ ■ ■ ,p £ ,n}- It is an exact derivation if there exist fh and T £ such that 
equation (5.15) is fulfilled. 

Equation (5.15) is satisfied if once all matches are calculated, the following identity 
holds: 

p n ;pe, n ; ■ ■ ■ ;pi;pe,i = p n ;--- ;pi;Pe, n ; ■ ■ -;Pe,i- (5-17) 

Proposition 5.4.2 With notation as in definition 5.4-1, ifpej-L (Pj-i\ ■ ■ ■ \Pi), Vj, then 
d n is exact. 



130 5 Matching 



Proof 

□Operator T e modifies the sequence adding a unique e-production, the composition of all 
e-productions p £i j. To see this, if one edge is to dangle, it should be eliminated by the 
corresponding e-production so no other e-production deletes it unless it is added by a 
subsequent production. But by hypothesis there is sequential independence of every p £ j 
with respect to all preceeding productions and hence p £ j does not delete any edge used 
by Pj-i, ■ ■ ■ ,Pi- In particular no edge added by any of these productions is erased. 

In definition 5.4.1, m is the extension of the match m which identifies the actual 
initial digraph in the host graph, so it adds to m (M n ) all nodes and edges to distance 
one to nodes that are going to be erased. A symmetrical reasoning to that of T E shows 
that rh is the composition of all mc,i- ■ 

With definition 5.4.1 and proposition 5.4.2 it is feasible to obtain a concatenation 
where all e-productions are applied first, and all grammar rules afterwards, recovering the 
original concatenation. Despite some obvious advantages, all dangling edges are deleted 
at the beginning which may be counterintuitive or even undesired if, for example, the 
deletion of a particular edge is used for synchronization purposes. 

The following corollary states that exactness can only be ruined by internal e- 
productions. 

Corollary 5.4.3 Let s n be a sequence to be applied to a host graph G and Mk 6 DJt (s n ). 
Assume there exists at least one match in G for Mk that does not add any internal 
e-production. Then, d n is exact. 

Proof (sketch) 

□All potential dangling elements are edges surrounding the actual initial digraph. It 
is thus possible to adapt the part of the host graph modified by the sequence at the 
beginning, so applying proposition 5.4.2 we get exacteness. ■ 

5.5 Summary and Conclusions 

In this chapter we have seen how it is possible to match the left hand side of a production 
in a given graph. We have not given a matching algorithm, but the construction of 
derivations out of productions. 
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There are two properties that we would like to highlight. The expressive power of 
Matrix Graph Grammars lies in between that of other approaches such as DPO and 
SPO: 

• We find it more intuitive and convenient to demand injectiveness on matches. This can 
be seen as a limitation on the semantics of the grammar but, on the other hand, not 
asking for injectiveness might present a serious problem. For example, when injectivity 
is necessary for some rules or non-injectivity is not allowed in some parts of the host 
graph. In a limit situation, it can be the case that several nodes and edges collapse 
to a single node and a single edge. 

• Rules can be applied even if they do not consider every edge that can appear in 
some given state. The grammar designer can concentrate on the algorithm at a more 
abstract level, without worrying about every single case in which a concrete rule needs 
to be applied. 11 

An advantage of e-productions over previous approaches to dangling edges is that 
they are erased by productions. This increases our analysis abilities as there are no side 
effects. 

We have also introduced marking, useful in many situations in which it is necessary 
to guarantee that some parts of two or more rules will be matched in the same area of 
the host graph. It will be used throughout the rest of the book. 

Initial and negative digraph sets are a generalization of minimal and negative initial 
digraphs in which some or all possible identifications are considered. Actually these 
concepts could have been introduced in Chap. 4, but we have postponed their study 
because we find it more natural to consider them once the matching has been introduced. 

We have classified the productions generated at run-time in internal and external. 
In fact, it would be more appropriate to speak of internal and external edges, but this 
classification suffices for our purposes. 

11 In cases of hundreds of rules, when every rule adds and deletes nodes and edges, it can be 
very difficult to keep track if some actions are still available. The canonical example would be 
a rule p that deletes some special node but can not be applied because some other production 
eventually added one incident edge that is not considered in the left hand side of p. 
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We are now in the position to characterize applicability, problem 1 stated on p. 7. 
In essence, applicability characterizes when a sequence is a derivation with respect to a 
given initial graph. 

Theorem 5.5.1 (Applicability Characterization) A sequence s n is applicable to G 
if there are matches for every production (define the derivation d n as the sequence s n 
plus these matches ) such that any of the two following equivalent conditions is fulfilled: 

• Derivation d n is coherent and compatible. 

• d n 's minimal initial digraph is in G and d n 's negative initial digraph is in G. 

Proof 
□■ 

Applicability will be used in Chap. 7 to characterize consistency of application con- 
ditions and graph constraints. 

In the next chapter sequentialization and parallelism are studied in detail. Problem 
3, sequential independence (stated on p. 8), will be addressed and, in doing so, we will 
touch on parallelism and related topics. 

Chapter 7 generalizes graph constraints and application conditions and adapts them 
for Matrix Graph Grammars. This step is not necessary but convenient to study reach- 
ability, problem 4 stated on p. 8, which will be carried out in Chap. 8. 
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Sequentialization and Parallelism 



In this chapter we will study in some detail problem 3 (sequential independence, p. 8) 
which is a particular case of problem 2 (independence, p. 8). Recall from Chap. 1 that 
two derivations d and d' are independent for a given state G if d(G) = H = H' = d'(G). 
We call them sequential independent if, besides, 3 a permutation such that d' = a(d). 

Applicability (problem 1) is one of the premises of independence, establishing an obvi- 
ous connection between them. In Chap. 8 we will sketch the relationship with reachability 
(problem 4) and conjecture one with confluence (problem 5) in Chap. 9. 

In Sec. 6.1 G-congruence is presented, which in essence poses conditions for two 
derivations (one permutation of the other) to have the same minimal and negative initial 
digraphs. The idea behind sequential independence is that changes of order in the position 
of productions inside a sequence do not alter the result of their application. This is 
addressed in Sec. 6.2 for sequences and in Sec. 6.3 for derivations. If a quick review of 
permutation groups notation is needed, please see Sec. 2.3. In Sec. 6.4 we will see that 
there is a close link between sequential independence and parallclization (see Church- 
Rosser theorems in, e.g. [11]). As in every chapter, we will close with a summary (Sec. 
6.5). 



134 6 Sequentialization and Parallelism 



6.1 G-Congruence 

Sameness of minimal and negative initial digraphs for two sequences - one a permutation 
of the other - or for two derivations if some matches have been given, will be known as 
G-congruence. This concept helps in characterizing sequential independence. 

Definition 6.1.1 (G-congruence) Two coherent sequences s n and a(s n ), where a is 
a permutation, are called G-congruent if they have the same minimal and negative initial 
digraphs, M Sn = M a(Sn) and N Sn = N a(Sn) . 

We will identify the conditions that must be fulfilled in order to guarantee equality of 
initial digraphs, first for productions advancement and then for delaying, starting with 
two productions, continuing with three and four to end up setting the theorem for the 
general case. 

The basic remark that justifies the way we tackle G-congruence is that a sequence 
and a permutation of it perform the same actions but in different order. Initial digraphs 
depend on actions and the order in which they are performed. The idea is to concentrate 
on how a change in the order of actions may affect initial digraphs. 

Suppose we have a coherent sequence made up of two productions s 2 = P2',Pi with 
minimal initial digraph M 2 and, applying the (only possible) permutation cr 2 , get another 
coherent concatenation s' 2 = VxWi with minimal initial digraph M 2 . Production pi docs 
not delete any element added by p 2 because, otherwise, if p\ in s 2 deleted something, 
it would mean that it already existed (as p\ is applied first in s 2 ) while p 2 adding that 
same element in s' 2 would mean that this element was not present (because p 2 is applied 
first in s' 2 ). This condition can be written: 



A similar reasoning states that p\ can not add any element that p 2 is going to use: 



eir 2 = 0. 



(6.1) 



riL 2 



0. 



(6.2) 



Analogously for p 2 against pi, i.e. for s' 2 



Pi;p 2 , we have: 



r 2 L\ 



e 2 ri 



0. 



0 



(6.3) 
(6.4) 
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As a matter of fact two equations are redundant - (6.1) and (6.3) - because they are 
already contained in the other two. Note that e^Li = ej, i.e. in some sense ei a Li, so it 
is enough to ask for: 

riL 2 v r 2 Li = 0. (6.5) 

It is easy to check that these conditions make minimal initial digraphs coincide, 
M 2 =M' 2 . In detail: 

M 2 = Mi v r\L 2 = L\ v f\L 2 v r\L 2 = ~L\ v L 2 
M 2 = M 2 v r 2 Li = Li v r 2 Li v r 2 Li = L 2 v L\. 

We will very briefly compare conditions for two productions with those of the SPO 
approach. In references [23; 24], sequential independence is defined and categorically 
characterized (see also Sees. 3.1 and 3.2, in particular equations 3.5 and 3.6). It is not 
difficult to translate those conditions to our matrix language: 

r x Li = 0 (6.6) 
e 2 Ri = e 2 ri v e 2 e\ L\ = 0. (6.7) 

First condition is (6.2) and, as mentioned above, first part of second condition 
(e 2 ri = 0) is already considered in (6.2). Second part of second equation {e<ie\L\ = 
0) is demanded for coherence, in fact something a bit stronger: e 2 Li = 0. Hence G- 
congruence plus coherence imply sequential independence in the SPO case, at least for a 
sequence of two productions. The converse does not hold in general. Our conditions are 
more demanding because we consider simple digraphs. 

Let's now turn to the negative initial digraph, for which the first production should 
not delete any element forbidden for p 2 (in such a case these elements would be in G for 
Pi;p 2 and in G for pi;pi): 

0 = eiN 2 = e\ri v eie 2 L> 2 . (6.8) 

Note that we already had e\r 2 = 0 in equation (6.1). A symmetrical reasoning yields 
e 2 eiDi = 0, and altogether: 

eie 2 D 2 v e 2 eiD 1 = 0. (6.9) 
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First monomial in (6.9) simply states that no potential dangling edge for p 2 (not 
deleted by p 2 ) can be deleted by p\. Equations (6.5) and (6.9) are schematically repre- 
sented in Fig. 6.1. 




ii 2 iu, L x m D,i 

Fig. 6.1. G-congruence for S2 = Vi\V\ 

It is straightforward to show that equation (6.9) guarantees the same negative initial 
digraph. In p 2 ;pi the negative initial digraph is given by Ni v exN-^. Condition (6.8) 
demands e\N% = 0 so we can or them to get: 

Ni v eiiV 2 v aN 2 = Ni v N 2 . (6.10) 

A similar reasoning applies to pi;p 2 , obtaining the same result. 

We will proceed with three productions so, following a consistent notation, we set 
s 3 = P3',P2',Pi, S3 = P2]Pi'iPz with permutation oz = [13 2] and their corresponding 
minimal initial digraphs M3 = L\ v f \ L2 v r\ r 2 L3 and M3 = L\ v r 2 L 2 v L3. 
Conditions are deduced similarly to the two productions case: 1 

r 3 Li = 0 r 3 L 2 r[ = 0 riL 3 = 0 r 2 L 3 e^ = 0. (6-H) 

Let's interpret them all. r 3 L\ = 0 says that p 3 cannot add an edge that p\ uses. This 
is because this would mean (by S3) that the edge is in the host graph (it is used by p\) 
but s 3 says that it is not there (it is going to be added by p 3 ). The second condition is 
almost equal but with p 2 in the role of p\ , which is why we demand p\ not to add the 

1 As far as we know, there is no rule of thumb to deduce the conditions for G-congruence. They 
depend on the operations that productions define and their relative order. 
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element (FT). Third equation is symmetrical with respect to the first. The fourth states 
that we would derive a contradiction if the second production adds something (r 2 ) that 
production p 3 uses (L 3 ) and pi does not delete (eT). This is because by S3 the element 
was not in the host graph. Note that s' 3 says the opposite, as p 3 (to be applied first) uses 
it. All can be put together in a single expression: 

L 3 (n v elr 2 ) v r 3 (Li v n L 2 ) = 0. (6-12) 

For the sake of completeness let's point out that there are other four conditions but 
they are already considered in (6.12): 

ei r 3 = 0 r 3 e 2 J\ = 0 e 3 ri = 0 r 2 e 3 el = 0. (6.13) 

Now we deal with those elements that must not be present. Four conditions similar 
to those for two productions - compare with (6.8) - are needed: 

eiiV 3 = eir 3 v e{e 3 D 3 = 0 
e 3 iVi = e 3 ri v e 3 e x Di = 0 
e 3 N 2 ei = e 3 r 2 ei v e 3 eie 2 D 2 = 0 

e 2 N 3 r 1 = e 2 r 3 T x v e 2 r{e 3 D 3 = 0. (6.14) 

Note that first monomial in every equation can be discarded as they are already 
considered in (6.12). We put them altogether to get: 

eie 3 D 3 v e 3 e 2 eiD 2 v e 3 e\Bi v e 2 e 3 ri_D 3 = 

= e 3 (ei-Di v e{e 2 D 2 ) v e 3 D 3 (ei v fie 2 ) . (6.15) 

In Fig. 6.2 there is a schematic representation of all G-congruence conditions for 
sequences s 3 = p 3 ;p 2 ;pi and s' 3 = p 2 ;p\;p 3 . These conditions guarantee sameness of 
minimal and negative initial digraphs, which will be proved below, in theorem 6.1.6. 

Moving one production three positions forward in a sequence of four productions, 
i.e. P4;P3',P2',Pi >-» P3',P2',Pi',Pi, while maintaining the minimal initial digraph has as 
associated conditions those given by the equation: 

2 Notice that by Prop. 4.1.4, equations (4.10) and (4.13) in particular, we can put YlLi instead 
of just Li and eTXi instead of just n. It will be useful in order to find a closed formula in 
terms of V. 
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4 • 

Fig. 6.2. G-congruence for Sequences S3 = P3\P2',Pi and s' 3 = P2;pv,ps 

L 4 (n v ei r 2 v ei e 2 r 3 ) v r 4 (ii v ?i L 2 v ri r 2 £3) = 0. (6.16) 

and the negative initial digraph by: 

e 4 (ei Di v ei e 2 D 2 v ei e 2 e 3 -D 3 ) v e 4 D4 (ei vn e 2 v 7\ r 2 e 3 ) = 0. (6-17) 

Equations (6.16) and (6.17) together give G-congruence for S4 and s' 4 are depicted on 
Fig. 6.3. 

Before moving to the general case, let's briefly introduce and put an example of a 
simple notation for cycles moving forward and backward a single production: 

1. Advance production n — 1 positions: 0„ = [1 n n — 1 ... 3 2]. 

2. Delay production n — 1 positions: 6 n = [ 1 2 ... n — 1 n]. 

Example. ^Consider advancing three positions the production p§ inside the sequence 
S5 = P5',P4;P3;P2;Pi to get (f> 4 (s s ) = P4,;P3;P2;Ps;Pl, where 04 = [l 4 3 2]. 

To illustrate the way in which we represent delaying a production, moving backwards 
production p 2 two places P5',Pi',Pz',P2',Pi ' — > P5] P2] Pa] P3] Pi has as associated cycle 
J4 = [2 3 4]. Note that numbers in the permutation refer to the place the production 
occupies in the sequence, numbering from left to right, and not to its subindex.B 

Conditions that must be fulfilled in order to maintain the minimal and negative initial 
digraphs will be called congruence conditions and will be abbreviated as CC, positive CC 
if they refer to minimal initial digraph and negative CC for the negative initial digraph. 
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?ii rl 4 

Fig. 6.4. G-congruence (Alternate Form) for S3 and s 3 



r 3 rxLi v L> 3 e 3 riei = F1L1 (r 3 v eie 3 D 3 ) = 

= T\L\ (r 3 ei v r 3 ei v eie 3 £> 3 ) = 

= rxLx (exN 3 v r 3 ei) = riLi7V 3 (ei v r 3 ) . (6.21) 

Last equality holds because -/V^r, = r,; v riDi = ri and a w ab = a v b. We have also 
used that Ni&i = Hi (r^ v e^Di) = iVj. The same sort of calculations for S4 and S4 are 
summarized in Fig. 6.5. 

• Ec E 

Fig. 6.5. G-congruence (Alternate Form) for S4 and s 4 
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A formula considering the positive (6.18) and the negative (6.19) parts can be derived 
by induction. It is presented as a proposition: 

Proposition 6.1.2 Positive and negative congruence conditions for sequences s n and 
s' n = ^n(sn) are given by: 

CC n ((f> n , s n ) = L n Vy _1 5xJV» (r y v e n ) v ^V™" 1 ^^ (e„ v r n ) . (6.22) 

Proof 
□■ 

G-congruence is obtained when CC n (</>„, s n ) = 0. An equivalent reasoning does it for 
production delaying n— 1 positions, giving very similar formulas. Suppose that production 
pi is moved backwards in concatenation s n to get s' n = pi;p n ', ■ ■ ■ ',P2, i-e. S n is applied. 
The positive part of the condition is: 

CC+ (S n , s n ) = LiV£ (eZr y ) v nV£ L v ) = 0 (6.23) 

and the negative part: 

CC n (S n ,s n ) = D 1 e 1 V%(r^e y ) v e^ (<%D V ) =0. (6.24) 

As in the positive case it is possible to merge equations (6.23) and (6.24) to get a 
single expression: 

Proposition 6.1.3 Positive and negative congruence conditions for sequences s n and 
s'n = S n (s n ) are given by: 

CC n (S n , s n ) = L^e x N y (r v v ei) v N^r x L v (e y v n) . (6.25) 

Proof 
□■ 

It is necessary to show that these conditions guarantee sameness of minimal and 
negative initial digraphs, but first we need a technical lemma that provides us with some 
identities used to transform the minimal initial digraphs. Advancement and delaying are 
very similar so only advancement is considered in the rest of the section. 
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Lemma 6.1.4 Suppose s n = p n ;...;pi and s' n = a (s n ) = p n -i', ■ ■ ■ ',Pi',Pn an d that 
CC+ (4> n ) is satisfied. Then the following identity may be or'ed to s n 's minimal initial 
digraph M n without changing it: 

DC+(<f> ni s n )=L n V r l- 2 (r^e v ). (6.26) 

Proof 

□Let's start with three productions. Recall that M3 = L\ v other ■J.erms and that L\ = 
Li v ei = Li v ei v eiL^ (last equality holds for any logical formula a v ab = a). Note 
that eiL 3 is (6.26) for n = 3. 

For n = 4, apart from eiL 4 , we need to get e-^FiL^ (because the full condition is 
DCX = L4 (ei vr 1 e2))- Recall again the minimal initial digraph for four productions 
whose first two terms are M 4 = L\ v rTL 2 . It is not necessary to consider all terms in 
Mi to get DC 4 + : 

M 4 = (Li v ei) v (n"L 2 v ne 2 ) v . . . = 

= (Li v ei v e 4 L 4 ) v (r 4 L 2 v rie 2 v rie 2 L 4 ) v . . . = 
= (Li v ei_L 4 ) v (rii 2 v rie 2 L 4 ) v . . . = 
= M 4 vDC+. 

The proof can be finished by induction. ■ 

Next lemma states a similar result for negative initial digraphs. We will need it to 
prove invariance of the negative initial digraph. 

Lemma 6.1.5 With notation as above and assuming that CC~ (<f> n ) is satisfied, the 
following identity may be or'ed to the negative initial digraph N without changing it: 

DC-{<t> n , s n ) = e„^„vr 2 (e^r„) . (6.27) 

Proof 

□We follow the same scheme as in the demonstration of lemma 6.1.4. Let's start with 
three productions. Recall that 7V 3 = Ni v other_terms and that Ni = Ni v r\ = N\ v 
ri v ri~e 3 D 3 . Note that r\esD 3 is (6.27) for n = 3. 

For n = 4, besides the term r 4 e 4 £> 4 we need to get eir 2 e 4 .D 4 (because DC^ = 
e 4 D 4 (r 4 v e 4 r 2 )). The first two terms of the negative initial digraph for four productions 
are iV 4 = N\ v eT^. Again, it is not necessary to consider the whole formula for iV 4 : 
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N4, = (Ni v ri) v (eiA^2 v r-iei) v . . . = 

= (iVi v r\ v rie^Di) v (eiA^ v eir2 v e1.r2e4.D4) v 
= (N\ v 7'ie4Z34) v (eiiV2 v e^e^D^ v . . . = 
= N 4 v £>C 4 -. 

The proof can be finished by induction. ■ 

r,l 

Fig. 6.6. Positive and Negative DC Conditions, DC£ and DC§ 



4 



Both, .DC^ 1- and DC§ are depicted in Fig. 6.6 for advancement of a single production 
s 5 = P5)p4jP3)P2;Pi 1 — > s 3 = P4iP3jP2;Pi;P5- Note the similarities with first and fourth 
branches of figure 6.3. 

Remark. nlf CC~ and DC~ are applied independently of CC+ and -DC+ then the 
expression 

DC n (</»„, s„) = 7V„Vr 2 (e^r„) (6.28) 

should be used instead of definition given by equation (6.27).B 

We are ready to formally state a characterization of G-congruence in terms of con- 
gruence conditions CC: 

Theorem 6.1.6 With notation as above, if s n and s' n = <j) n (s n ) are coherent and con- 
dition CC (<f> n ,s n ) is satisfied then they are G-congruent. 
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Proof 

□First, using CCf and DCf , we will prove Mj = M- for three and five productions. 
Identities a v ab = a v b and a v ab = a v b will be used in this demonstration. 

M 3 v CC 3 v -DC^ = [Li vfi£ 2 v Fi r 2 L 3 ] v [ri_L 3 v ei r 2 L 3 v r 3 Li v 
v fi r 3 L 2 ] v [eiL 3 ] = Li vriL 2 v f 1 r 2 L 3 v riL 3 v 
v fi r 2 L 3 v eii 3 = Li v fi L 2 v f 2 L 3 v r 2 L 3 v 
v L 3 (ri v ei) = Lx v fi L 2 v i 3 . 

In our first step, as neither r 3 Li nor fx r 3 i 2 are applied to M 3 , they have been 
omitted (for example, L\ v r 3 Lx = Lx). Once riL 3 , eiL 3 and r 2 L 3 have been used, they 
are omitted as well. 

Let's check out M3, where in second equality riL 3 and r 2 eiL 3 arc ruled out since 
they are not used: 

M 3 v CC 3 = [r 3 L x v fx r 3 i 2 v £ 3 ] v [riL 3 v r 2 ei L 3 v r 3 Li v fx r 3 L 2 ] = 
= f 3 Lx v fi f 3 L 2 v L 3 v r 3 Lx v n r 3 L 2 = 
= Ii v fi L 2 v L 3 . 

The case for five productions is almost equal to three productions but it is useful 
to illustrate in detail how CC§ and DC^ are used to prove that M5 = M' h in a more 
complex situation. The key point is the transformation fx r 2 r 3 r 4 L5 1 — > L5 and the 
following identities show the way to proceed: 

f 1 f 2 f 3 f 4 i 5 v rxL 5 = r 2 r 3 r 4 L 5 
f 2 r 3 r 4 i 5 v f 1 r 2 L 5 v eii 5 = r 3 r 4 L 5 
f 3 r 4 L 5 v fi ^ 2 r 3 L 5 veiL 5 v fie 2 i 5 v rii 5 = r 4 L 5 
f 4 L 5 v efi f 2 f 3 r 4 L 5 v eii 5 v fie 2 i 5 v riL 5 

v fi f 2 e 3 L 5 v tfxr 2 L 5 = L 5 . 

Note that we are in a kind of iterative process: What we get on the right of the 
equality is inserted and simplified on the left of the following one, until que get L 5 . For 
L4 the process is similar but shorter. 
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Now one example for the negative initial digraph is studied, A3 v CC 3 v DC 3 = 
v CC 3 -: 

A^ v CC3 v DC3 = [Ni veiN 2 v ei e 2 N3] v [e^Ni v e\ e 3 N 2 v eiiV 3 v 
v r% e2 A3] v [ri N3] = N\ v e\ N 2 v efi e2 -/V3 v ei v 
v fi e 2 N3 v riN$ = N± v e.\ N 2 v 1$ 2 N3 v e 2 N 3 v 
v N3 (n v ei) = iVj v ei JV2 v A^. 

N3 v CCg" = [e3 Ni v ei S3 A^ v A3] v [ei A3 v e 2 f% N 3 v e 3 Afi v e% ezN 2 ] 
= ?3 Ni v ei % A^2 v A3 v 63^1 v ei 63^2 = 
= iVi v ei JV 2 v A3. 

The procedure followed to show N3 = is completely analogous to that of A/3 = M' 3 M 

2 

4 5 * 



Fig. 6.7. Altered Production 53 Plus Productions qi and </2 

Remark □ Congruence conditions report what elements prevent G-congruence. In this 
way not only information of sameness of minimal and negative initial digraphs is avail- 
able but also what elements prevent G-congruence. For example, another way to see 
congruence conditions is as the difference of the minimal initial digraphs in the positive 
case.B 
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Example. nReusing productions introduced so far (gi, q 2 and q 3 ), 4 we are going to 
check G-congruence for a sequence of three productions in which one is directly delayed 
two positions, i.e. it is not delayed in two steps but just in one. As commented before, 
it is mandatory to change 173 in order to keep compatibility, so a new production q' 3 is 
introduced, depicted in Fig. 6.7. 

The minimal initial digraph for the sequece q' 3 ; q 2 ; q\ remains unaltered, i.e. M q >^ q2 . qi = 
M q3 .q 2m (compare with Fig. 4.20 on p. 107), but the one for q\\ q' 3 ; q 2 is slightly different 
and can be found in Fig. 6.8 along with the concatenation s' 123 = qi\q' 3) q2 and its 
intermediate states. 




Fig. 6.8. Composition and Concatenation. Three Productions 



In this example, production q\ is delayed two positions inside S3 = q' 3 ;q2', qi to obtain 
£>3 (S3) = Qi'iQsl <Z2> Such permutation can be expressed as #3 = [1 2 3]. 5 Only the positive 
case CC 3 (63,33) is illustrated. Formula (6.23) expanded and simplified is: 

4 In examples on pp. 77, 80, 95 and 108. 

5 Numbers 1, 2 and 3 in the permutation mean position inside the sequence, not production 
subindex. 
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Li (r 2 v e 2 r 3 ) v n (i 2 v r 2 Lg) . 



(6.29) 



(*} 



(**) 



If the minimal initial digraphs are equal, then equation (6.29) should be zero. Node 
ordering is [2 3 5 1 4], not included due to lack of space. 
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and the same for 
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/ 

We detect nonzero elements (1, 5) and (3, 1) in (*) and (1, 2), (2, 3) and (3, 2) in (**). 
They correspond to edges (2, 4), (5, 2), (2, 3), (3, 3) and (5, 3), respectively. Both minimal 
initial digraphs are depicted together in Fig. 6.9 to ease comparison. ■ 



2 1 2 1 

3 3 
5 T * 15 4 

Fig. 6.9. Example of Minimal Initial Digraphs 



Previous results not only detect if the application of a permutation (limited to ad- 
vancing or delaying a single element) let minimal initial digraphs unaltered, but also 
what elements are changed. 



148 6 Sequentialization and Parallelism 



6.2 Sequentialization - Grammar Rules 

In this section we will deal with position interchange inside a sequence of productions. 
For example, let S3 = ps]P2]Pi be a coherent sequence made up of three productions and 
suppose we wanted to move pz forward one position to obtain a (S3) = Vi',Vz\V\- This 
can be seen as a permutation a acting on S3's indices. 6 

Although we are not considering matches yet, there is a close relationship between 
this definition and problem 3 that we will explore in this and next sections. 

This section first introduces sequential independence for productions and a character- 
ization through G-congruence, compatibility and coherence. G-congruence and related 
conditions have been studied in Sec. 6.1. Similar results for coherence (advancement and 
delaying of a single production) are also derived. 

Definition 6.2.1 (Sequential Independence) Let s n = p n ; . . . ;pi be a sequence and 
g a permutation. Then, s n and a (s n ) are said to be sequential independent if both add 
and remove the same elements and have the same minimal and negative initial digraphs. 

Compatibility and coherence imply sequential independence provided s n and cr(s„) 
have the same minimal and initial digraphs. 

Theorem 6.2.2 With notation as above, if s n is compatible and coherent and a (s„) is 
compatible and coherent and both are G- congruent, then they are sequential independent. 

Proof 

□By hypothesis we can define two productions c s , c CT ( s ) which are respectively the com- 
positions coming from s n and ct(s„). Using commutativity of sum in formulas (4.67) and 
(4.68) - i.e. the order in which elements are added does not matter - we directly see that 
s„ and ct(s„) add and remove the same elements. G-congruence guarantees sameness of 
minimal and negative initial digraphs. ■ 

Note that, even though the final result is the same when moving sequential indepen- 
dent productions inside a given concatenation, intermediate states can be very different. 

In the rest of this section we will discuss permutations that move one production for- 
ward or backward a certain number of positions, yielding the same result. This means, 

6 Notation of permutation groups is summarized in Sec. 2.6 
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using theorem 6.2.2 and assuming compatibility and G-congruence, finding out the con- 
ditions to be satisfied such that starting with a coherent sequence we again obtain a 
coherent sequence after applying the permutation. 

Theorem 6.2.3 Consider coherent sequences t n = p a ', p n j Pn-i\ ■ ■ ■ \Pi\P\ an d s n = 
p n ;p n -i', ■ ■ ■ ',P2',Pi',Pp and permutations <p n+ i and S n +i- 

1. (f> n+ i (t n ) - advances p a application - is coherent if 

ef V? £?) v R E a V ? rf ) = 0. (6.30) 

2. S n+ i (s n ) - delays pp application - is coherent if 

Lf A? (jfef) v rf A™ (if Rf) = 0. (6.31) 

Proof 

□Both cases have a very similar demonstration so only production advancement is in- 
cluded. The way to proceed is to check differences between the original sequence t n and 
the swapped one, 4> n (t n ), discarding conditions already imposed by t n . 

We start with t<i = p a ]p2',Pi ' — > 4>3 (^2) = P2',Pi',Pa, where 4>z = [13 2]. Coherence 
of both sequences specify several conditions to be fulfilled, included in table 6.1. Note 
that conditions (t.1.7) and (t.1.10) can be found in the original sequence - (t.1.2) and 
(t.1.5) - so they can be disregarded. 



Coherence of p a ; P2 ; pi 


Coherence of P2',Pi;p a 


efL*=0 (t.1.1) 
efLf=0 (t.1.2) 
efL^ = 0 (t.1.3) 


efLf=0 (t.1.7) 
efl,f=0 (t.1.8) 
e£L?if =0 (t.1.9) 


r£R$ = 0 (t.1.4) 
r§Rf = 0 (t.1.5) 
rffifif = 0 (t.1.6) 


r$Rf = 0 (t.1.10) 
rf R% = 0 (t.1.11) 
rfi?f ef = 0 (t.1.12) 



Table 6.1. Coherence for Advancement of Two Productions 



We would like to express previous identities using operators delta (4.40) and nabla 
(4.41) for which equation (4.13) is used on (t.1.8) and (t.1.9): 



150 
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p E T E e _ 

e a Ll \ '1 — 



0 



(6.32) 




0. 



(6.33) 



For the same reason, applying (4.10) to conditions (t.1.11) and (t.1.12): 



rf ef Rl = 0 
rfifiJfif = 0. 



(6.34) 
(6.35) 



Condition (t.1.4) can be split into two parts - recall (4.31) and (4.32) - being r^rf = 0 
one of them. Doing the same operation on (t.1.12), r^rf ef = 0 is obtained, which is 
automatically verified and therefore should not be considered. It is not ruled out since, 
as stated above, we want to get formulas expressible using operators delta and nabla. 
Finally we get the equation: 



Performing similar manipulations on the sequence t 3 = p a ', P3', P2', Pi we get 04 (t 3 ) = 
P3: P2', Pi', Pa (with </>4 = [ 1 4 3 2 ]); we find out that the condition to be satisfied is: 



Figure 6.10 includes the associated graphs to previous example and to n = 4. The 
proof can be finished by induction. ■ 

Previous theorems foster the following notation: If (6.30) is satisfied and we have 
sequential independence, we will write p a _L (p n ; . . . ;pi) whereas if equation (6.31) is true 
and again they are sequential independent, it will be represented by (p n ; . . . ;p\) -Lpp- 
Note that if we have the coherent sequence made up of two productions P2',Pi and we 
have that Pi;p2 is coherent we can write P2-I-P1 to mean that either p 2 may be moved to 
the front or p\ to the back. 

Example. nit is not difficult to put an example of three productions ts = Wz;w2;wi 
where the advancement of the third production two positions to get t' 3 = W2',wi; w 3 
has the following properties: Their associated minimal initial digraphs - M and M', 




(6.36) 




(6.37) 
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Fig. 6.10. Advancement. Three and Five Productions 

respectively - coincide, they are both coherent (and thus sequential independent) but 
*3 = W2',W'i;wi can not be performed, so it is not possible to advance W3 one position 
and, right afterwards, another one, i.e. the advancement of two places must be carried 
out in a single step. 

• 1 w, • 1 • 1 w 2 • 1 • 1 w 3 • 1 

► ► ► 

*• 2 • 2 ! • 2 N» 2 f Ml 2 *• 2 

Fig. 6.11. Three Simple Productions 



As drawn in Fig. 6.11, w\ deletes edge (1,2), W2 adds it while it is preserved by W3 
(appears on its left hand side but it is not deleted). 

Using previous notation, this is an example where u^l. (u>2; wi) but W3<tw2- As far 
as we know, in SPO or DPO approaches, testing whether u>3_L (it^; wi) or not has to 
be performed in two steps: W3I.W2, that would allow for W3;w2;wi 1— > w;2; u>3; Wi, and 
W3I.W1 to get the desired result: 102; w\\ W3M 
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Fig. 6.12. Altered Production q' 3 Plus Productions qi and qi (Rep.) 



Example. □ We will use productions qi, q-z and q' 3 (reproduced again in Fig. 6.12). 
Production q' 3 is advanced two positions inside q' 3 ;q2',qi to obtain (72; qi] q' 3 - Such per- 
mutation can be expressed as 03 = [1 3 2]. 7 Formula (6.30) expanded, simplified and 
adapted for this case is: 



e 3 (£1 v riL 2 ) v R 3 (n v eir 2 ) 



(6.38) 



(*) 



(**) 



Finally, all elements arc substituted and the operations are performed, checking that 
the result is the null matrix. Node ordering is [2 3 5 1 4], not included due to lack of 
space. The first part (*) is zero: 



0 0 0 0 0 
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0 0 10 1 




10 111 




0 10 0 0 
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110 0 0 




0 0 0 0 0 




10 111 




0 0 0 0 0 


0 10 0 0 






10 10 0 
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10 111 




0 0 10 0 




0 0 0 1 0 






0 0 0 0 0 




11111 




0 0 0 0 0 




0 0 0 0 0 






0 0 0 0 0 




11111 




0 0 0 0 0 


/ 



and the same for (**) 



7 Numbers 1, 2 and 3 in the permutation mean position inside the sequence, not production 
subindex. 
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and hence the permutation is also coherent. ■ 

6.3 Sequential Independence — Derivations 

Sequential independence for derivations is very similar to sequences studied in previous 
section, the main difference being that there is a state now to be taken into account. 

Here a will represent an element of the group of permutations and derivation d n will 
have associated sequence s n . Note that two sequences s n and s' n = a(s n ) carry out the 
same operations but in different order. 

Definition 6.3.1 Two derivations d n and d' n = a (d n ) are sequential independent with 
respect to G if d n (G) = H n =s H' n = d' n (G). 

Compare with problem 3 on p. 8. Even though s' n = a(s n ), if e-productions appear 
because the same productions are matched to different places in the host graph, then it 
might not be true that d' n = o~(d n ). 

A restatement of definition 6.3.1 is the following proposition. The existence of an 
initial digraph guarantees coherence for both derivations. 

Proposition 6.3.2 If for two applicable derivations d n and d' n = a(d n ) 

1. 3M 0 c G such that 0 # M 0 e Wl(s n ) n Wl(s' n ) and 

2. the corresponding negative initial digraph Nq 6 9t(s„) n ^Sl{s' n ), 

then d n (Mo) and d' n (M 0 ) are sequential independent. 
Proof 

□Existence of a minimal initial digraph and its corresponding negative initial digraph 
guarantees coherence and compatibility. As it is the same in both cases, they are G- 
congruent. A derivation and any of its permutations carry out the same actions, but in 
different order. Hence, their result must be isomorphic. ■ 
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If two derivations (with underlying permuted sequences) are not a permutation of 
each other due to e-productions but are confluent (their image graphs are isomorphic), 
then in fact it is possible to write them as a permutation of each other: 

Proposition 6.3.3 If d n and d' n are sequential independent and s' n = u{s n ), then 
3 <7 | d' n = <j(d n ) for some appropriate composition of s -productions. 

Proof 

□Let T : p £ i— > T(p £ ) be an operator acting on e-productions, which splits them into a 
sequence of n productions each with one edge. 8 

If T is applied to d n and d' n we must get the same number of e-productions. Morover, 
the number must be the same for every type of edge or a contradiction can be derived 
as e-productions only delete elements. ■ 

Example. oDefine two productions p\ and p2, where p\ deletes edge (2,1) and p 2 
deletes node 1 and edge (1,3). Define sequences s 2 = P2\Pi and s' 2 = P\\P2 and apply 
them to graph G depicted in Fig. 6.13 to get H n and H' nl respectively. Note that p\ and 
Pi are not sequential independent in the sense of Sec. 6.2 with this identification. 



G Hi 2 2 H, 



2 Lr n 1 2 

m 

V 

2 3 T 



• 1 v 1# = !• p, 3# \ 1 1' 1 



3 3 " 3 

Fig. 6.13. Sequential Independence with Free Matching 



Suppose that in s' 2 the match m-2 for production p 2 identifies node 1. In this case an 
e-production p e .2 should appear deleting edge (2,1), transforming the concatenation to 
s 2 = Pi]P2]Pe,2 and making p\ inapplicable. If m 2 identifies node 1' instead of 1, then 
we have H n = H' n with the obvious isomorphism (1, 2, 3) i-» (1', 2, 3), getting in this case 
p 2 lpi- Note that M 0 (s' 2 ) eWl(s 2 ) n Tt(s' 2 ) (see Fig. 6.14). 

Neither sequence s 2 nor s 2 add any edge and only p 2 deletes one node. The negative 
digraph set has just one element that has been called A^, also depicted in Fig. 6.14.B 

8 More on operator T in Chap. 7. It is used in Sec. 7.3 for application conditions. 
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Fig. 6.14. Associated Minimal and Negative Initial Digraphs 



The theory developed so far fits well here. Results for sequential independence such 
as theorem 6.2.2, for coherence (theorems 4.3.5, 6.2.3 and 6.2.3) and for minimal and 
negative initial digraphs are recovered. 

Marking (see Sec. 5.2) can be used to freeze the place in which productions are applied. 
For example, if a production is advanced and we already know that there is sequential 
independence, any node identification across productions should be kept because if the 
production was applied at a different match sequential independence could be ruined. 



6.4 Explicit Parallelism 

This chapter finishes analyzing which productions or group of productions can be com- 
puted in parallel and what conditions guarantee this operation. Firstly we will take into 
account productions only, without initial state. 



G 




Fig. 6.15. Parallel Execution 
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In the categorical approach the definition for two productions is settled considering 
the two alternative sequential ways in which they can be composed, looking for equality in 
their final state. Intermediate states are disregarded using categorical coproduct of the 
involved productions (see Sec. 3.1). Then, the main difference between sequential and 
parallel execution is the existence of intermediate states in the former, as seen in Fig. 
6.15. We follow the same approach saying that it is possible to execute two productions 
in parallel if the result does not depend on generated intermediate states. 

Definition 6.4.1 Two productions pi and p 2 are said to be truly concurrent if it is 
possible to define their composition and it does not depend on the order: 

P2°Pi =Pi°P2- (6.39) 

We use the notation p\ \pi to denote true concurrency. True concurrency defines a 
symmetric relation so it does not matter whether p\ \pi or p 2 || p\ is written. 

Next proposition compares true concurrency and sequential independence for two pro- 
ductions, in the style of the parallelism theorem - see [11] -. 9 The proof is straightforward 
in our case and is not included. 

Proposition 6.4.2 Let s 2 =P2',Pi be a coherent concatenation and assume compatibil- 
ity, then: 

Pi II P2 P2-Lpi- (6.40) 

Proof 

□Assuming compatibility frees us from £-productions.B 

So far we have just considered one production per branch when parallelizing, as 
represented on the left of Fig. 6.16. One way to deal with more general schemes - center 
and right of the same figure - is to test parallelism for each element in one branch against 
every element in the other. 

Consider the scheme in the middle of Fig. 6.16. Sequences si = P6',P5',P4 and 
s 2 = P3',P2' : Pi can be computed in parallel if there is sequential independence for every 

9 However, in DPO it is possible to identify elements once the coproduct has been performed 
through non-injective matches. 
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interleaving. This is true if pi \\ pj, Vi e {4,5,6}, Vj 6 {1,2,3}. There are many com- 
binations that keep the relative order of Si and S2, for example P6',P3',P2',P5',Pi',P4: or 
P3] Pe] Pi\ Pb'i Pi: Pi- In order to apply these two sequences in parallel, all interleavings 
that maintain the relative order should have the same result. 



P3 



P2 



'Pi 



Pe;P5;P4 



P3-,P2\Pl 




W2; tui 



Fig. 6.16. Examples of Parallel Execution 



Although it is not true in general, in many cases it is not necessary to check true 
concurrency for every two productions. The following example illustrates the idea. 

Example. nLet be given the concatenation W4; w 3 ] W2; u>i; wo- Some of its productions 
are depicted in Fig. 6.11 on p. 151. Rule wi deletes one edge, W2 adds the same edge 
while w 3 preserves it. 

We already know that w 3 ; W2] w\ is compatible and coherent and that W3I (w2]Wi). 
Both have the same minimal initial digraph. Following our previous study for two pro- 
ductions we would like to put W3 and W2 ; w\ in parallel, as depicted on the right of Fig. 
6.16. 

From a sequential point of view this diagram can be interpreted in different ways, 
depending on how they are computed. There are three dissimilar interleavings: (1) 
w 3 ;w 2 ;w 1 , (2) w 2 ;wi;w 3 and (3) w 2 ;w 3 ;w 1 . 

Any problem involving the first two possibilities is ruled out by coherence. As a matter 
of fact, W3 and W2',wi can not be parallelized because it could be the case that w 3 is 
using edge (1,2) when wi has just deleted it and before w 2 adds it, which is what the 
third case expresses, leaving the system in an inconsistent state. Thus, we do not have 
W3 || w 2 nor W3 || wi - we do not have sequential independence - but both w 3 \W2\w\ 
and u>2] wi; w 3 are coherent. ■ 
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One possibility to proceed is to use the fact that although it could be the case that 
P3 k. P2, it still might be possible to advance the production with the help of another 
production, i.e. pz -L (f>2!Pi) as seen in Sees. 6.2 and 6.3. 

Although there are some similarities between this concept and the theorem of con- 
currency, 10 here we rely on the possibility to characterize production advancement or 
delaying inside sequences more than just one position, hence, being more general. 

Theorem 6.4.3 Let s n = p n ; . . . ;pi and t m = q m ; . . . ; q\ be two compatible and coherent 
sequences with the same minimal initial digraph, where either n = 1 or m = 1. Suppose 
r m +n = t m ',s n is compatible and coherent and either t m l.s n or s n lt m . Then, t m \\ s n 
through composition. 

Proof 

□Using proposition (6.4.2).B 

Through composition means that the concatenation with length greater than one must 
be transformed into a single production using composition. This is possible because it is 
coherent and compatible - refer to proposition 4.5.3 -. In fact it should not be necessary 
to transform the whole concatenation using composition, but only the parts that present 
a problem. 

Setting n = 1 corresponds to advancing a production in sequential independence, 
while m = 1 to moving a production backwards inside a concatenation. In addition, in 
the hypothesis we ask for coherence of r„ and either t m ]_s n or s m l_t n . In fact, if r m+n 
is coherent and t m _Ls„, then s n l_t m . It is also true that if r m+n is coherent and s n !i m , 
then t m _Ls„ (it could be proved by contradiction). 

The idea behind theorem 6.4.3 is to erase intermediate states through composition 
but, in a real system, this is not always possible or desirable if for example these states 
were used for synchronization of productions or states. All this section can be extended 
easily to consider derivations. 

10 See Sec. 3.1 or [22]. 
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6.5 Summary and Conclusions 

In this chapter we have studied in more detail sequences and derivations, paying special 
attention to sequential independence. We remark once more that certain properties of 
sequences can be gathered during grammar specification. This information can be used 
for an a-priori analysis of the graph transformation system (grammar if an initial state 
is also provided) or, if properly stored, during runtime. 

In essence, sequential independence corresponds to the concept of commutativity 
(a ; b = b ; a) or a generalization of it, because commutativity is defined for two elements 
and here we allow a or b to be sequences. It can be used to reduce the size of the state 
space associated to the grammar. From a theoretical or practical-theoretical point of view, 
sequential independence helps by reducing the amount of productions combinatorics in 
sequences or derivations. This is of interest, for example, for confluence (problem 5 on p. 
9). 

Besides sequential independence for concatenations and derivations, we have also 
studied G-congruence, which guarantees sameness of the minimal and negative initial 
digraphs, and explicit parallelism, useful for parallel computation. 

One of the objectives of the present book is to tackle problems 2 and 3, independence 
and sequential independence, respectively, defined in Sec. 1.2. The whole chapter is di- 
rected to this end, but with success in the restricted case of advancing or delaying a single 
production an arbitrary number of positions in a sequence. This is achieved in theorems 
6.2.2 and 6.2.3, which rely on theorem 6.1.6 (G-congruence), and also in propositions 
6.3.2 and 6.3.3. 

These results can be generalised by addressing other types of permutations such as 
advancing or delaying blocks of productions. Another possibility is to study the swap 
of two productions inside a sequence. It can be addressed following the same sort of 
development along this chapter. Swaps of two productions are 2-cycles and it is well 
known that any permutation is the product of 2-cycles. 

In order to link this chapter with the next one, which deals with application conditions 
and restrictions on graphs, let's note that conditions that need to be fulfilled in order to 
obtain sequential independence can be interpreted as graph constraints and application 
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conditions. Graph constraints and application conditions are important both from the 
theoretical and practical points of view. 

In Chap. 8 reachability - problem 4 - will be studied and extended from Petri nets 
to Matrix Graph Grammars. Petri nets will be interpreted as a proper subset of Matrix 
Graph Grammars. 



7 



Restrictions on Rules 



In this chapter graph constraints and application conditions - that we call restrictions 
- for Matrix Graph Grammars will be studied, generalizing previous approaches to this 
topic. For us, a restriction is just a condition to be fulfilled by some graph. 

In the literature there are two kinds of restrictions: Application conditions and graph 
constraints. Graph constraints express a global restriction on a graph while application 
conditions are normally thought as local properties, namely in the area where the match 
identifies the LHS of the grammar rule. By generalizing graph constraints and application 
conditions we will see that they can express both local and global properties, and further, 
that application conditions are a particular case of graph constraints. 

It is at times advisable to speak of properties rather than restrictions. For a given 
grammar, restrictions can be set either during rule application (application conditions, 
to be checked prior to rule execution or after it) or on the shape of the state (graph 
constraints, which can be set on the input state or on the output state). 

Application conditions are important from both practical and theoretical points of 
view. On the practical side, they are convenient to concisely express properties or to 
synthesize productions, despite the fact that they open the possibility to partially act on 
the nihilation matrix. On the theoretical side, application conditions provide us with a 
new perspective to look at the left and right hand sides of a production and enlarges the 
scope of Matrix Graph Grammars, including multidigraphs. 

This book extends previous approaches using monadic second order logic (MSOL, see 
Sec. 2.1 for a quick overview). Section 7.1 sets the basis for graph constraints and ap- 
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plication conditions by introducing diagrams and their semantics. In Sec. 7.2 derivations 
and diagrams are put together, showing that diagrams are a natural generalization of 
graphs L and N (in the precondition case). Section 7.3 expresses all these results using 
the functional notation introduced in Sec. 5.1 (see also Sec. 2.5), proving consistency of 
Matrix Graph Grammars with restrictions. Section 7.4 tackles the topic of transforming 
application conditions imposed to a rule's LHS into one equivalent but on rule's RHS. 
The converse, more natural from a practical point of view, is also addressed. Probably the 
main drawback of Matrix Graph Grammars as presented so far is the fact that they are 
limited to simple digraphs. Section 7.5 shows that multidigraphs can also be considered 
with no major modification of the theory, by using graph constraints. Section 7.6 closes 
the chapter with a summary and some more comments. 

7.1 Graph Constraints and Application Conditions 

A graph constraint in Matrix Graph Grammars is defined as a diagram (a set of graphs 
and partial morphisms) plus a MSOL formula. 1 The diagram specifies the relationship 
between elements of the graphs that make up the constraint and the formula specifies the 
conditions to be fulfilled in order to make the host graph G satisfy the graph constraint. 



d L() 




Fig. 7.1. Application Condition on a Rules's Left Hand Side 

Example. □ Figure 7.1 shows a diagram associated to the left hand side of a production 
p : L — > R matched to a host graph G by tul- An example of associated formula can be 
f = 3LVA 0 3A 1 [L (Ao^A^M 

1 MSOL corresponds to regular languages [12], which are appropriate to express patterns. 
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We will focus on logical expressions representing that one simple digraph is contained 
in another, because this is in essence what matching does. To this end, the following two 
formulas arc introduced: 



which rely on predicate F(e,X), "element e is in digraph X" . 

The domain of discourse are simple digraphs, and the diagram is a means to represent 
the interpretation function I (see Sec. 2.1 for definitions). Note that P(Xi,X 2 ) holds if 
and only if X\ <z X 2 and Q(X\,X 2 ) is true if and only if X\ n X2 # 0. Formula P 
will deal with total morphisms and Q with non-empty partial morphisms (see graph 
constraint fulfillment, definition 7.1.5). 

Remark. uP E ( X\, X 2 ) says that every edge 2 in graph X\ should also be present in 
I2, so a morphism d\ 2 : X\ — > X 2 is demanded. The diagram may already include one 
such morphism (which can be seen as restrictions imposed on function I) and we can 
either allow extensions of d 12 (relate more nodes if necessary) or keep it as defined in 
the diagram. This latter possibility will be represented appending the subscript U to 
P E h-> P E . Predicate P E can be expressed 3 using P E : 



P§{X 1 ,X 2 ) = Vo[- (F(a, D) v F(a, coD))] = P E (D, coD) a P e (D c , coD c ) (7.3) 



where D = Dom(di 2 ), coD = coDom(di 2 ), c stands for the complement and v is the 
xor operation. For example, following the notation in Fig. 7.4, P\j (Ai,A 0 ) would mean 
that it is not possible to further relate another element apart from 1 between A 0 and A\. 
This could only happen when A 0 and A\ are matched in the host graph. 

Pu will be used as a means to indicate that elements not related by their morphisms 
in the diagram must remain unrelated. These relationships (forbidden according to Pu) 
could be specified either by other morphisms in the diagram or by matches in the host 

2 Mind the superscript E in P E . As in previous chapters, an E superscript means edge and an 
N superindex stands for node. 

3 Non-extensible existence of dio for a graph constraint is Vx e Ao, Vy e Ai , 1714(1) = rriA^y) o 
y = dic^x), with notation as in Fig. 7.4. In words: When elements are matched in the host 
graph (or in other graphs through different dij), elements unrelated by dio remain unrelated. 



P(X U X 2 ) = Ve[F(e,Xi) ^ F(e,X 2 )] 
Q(X U X 2 ) = 3e[F(e,Xi) a F(e,X 2 )] 



(7.1) 
(7.2) 
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graph. For example, two unrelated nodes of the same type in different graphs of the 
diagram can be identified as the same node by the corresponding matches in the host 
graph. Hence, even though not explicitely specified, there would exist a morphism relating 
these nodes in the diagram. Pjj prevents this side effect of matches. The same can happen 
if there is a chain of morphisms in the diagram such as A® — > A\ — > A2- There might 
exist an implicit unspecified morphism Aq —> A2M 

It is interesting for restrictions to be able to express negative conditions, that is, to 
express that some elements should not be present in the host graph. By elements we 
mean nodes, edges or both. When some elements are requested not to exist in G, one 
possibility is to find them in the complementary graph. 

To this end we will define a structure G = that in first instance consists 

of the negation of the adjacency matrix of G and the negation of its vector of nodes. 
We speak of structure because the negation of a digraph is not a digraph. In general, 
compatibility fails for G. 4 

Although it has been commented already, we will insist in the difference between 
completion and negation of the adjacency matrix. The complement of a graph coincides 
with the negation of the adjacency matrix, but while negation is just the logical oper- 
ation, taking the complement means that a completion operation has been performed 
before. Hence, taking the complement of a matrix G is the negation with respect to some 
appropriate completion of G. As long as no confusion arises negation and complements 
will not be syntactically distinguished. Graph whith respect to which the completion (if 
any) is performed will not be explicetly written from now on. 




Fig. 7.2. Finding Complement and Negation 

4 In Chap. 4 a matrix for edges and a vector for nodes were introduced to differentiate one 
from the other, mainly because operations could be performed on nodes or on edges. Recall 
that compatibility related both of them and completion permitted operations on matrices of 
different size (with a different number of nodes). 
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Example. ^Suppose we have two graphs A and G as those depicted in Fig. 7.2 and 
that we want to check that A is not in G. Note that A is not contained in G (node 3 
does not even appear), but it does appear in the negation of the completion with respect 
to A of G (graph Ga in the same figure). ■ 

The notation (syntax) will be alleviated a bit more by making the host graph G 
the default second argument for predicates P and Q. Besides, it will be assumed that 
by default total morphisms are demanded. That is, predicate P will be assumed unless 
otherwise stated. Our proposal to simplify the notation is to omit G and P in these cases. 
Also, it is not necessary to repeat quantifiers that are together, e.g. V Aq3Ai3A2^ A3 can 
be abbreviated as yA 0 3AiA 2 yA3. 

Example. nA sophisticated way of demanding the existence of one graph 3^4[A] is: 

3A N 3A E [P (A N ,A E ) aA n a A E ] 

that reads it is possible to find in G the set of nodes of A and its set of edges in the same 
place - P (A N , A E ) -. In this case it is possible to use the universal quantifier instead, 
as there is a single occurrence of A N in A E up to isomorphisms: 

\/A N 3A E [P (A N ,A E ) a A N a A E ] . 

As another example, the following graph constraint is fulfilled if for every Aq in G it 
is possible to find a related A\ in G: 

VA 0 3A 1 [Aq => A{\ , (7.4) 

which by definition is equivalent to 

VA 0 3A 1 [P(A 0 ,G)^P(A U G)]. (7.5) 

These syntax simplications just try to simplify most commonly used rules. ■ 

Negations inside abbreviations must be applied to the corresponding predicate, e.g. 
3A [A] = 3A [P (A, G)] is not the negation of A's adjacency matrix. For the case of 
edges, the following identity is fulfilled: 



P E (A,G) = Q(A, G E ). 



(7.6) 
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The part that takes care of the nodes is easier, so from now on we will mainly con- 
centrate on edges and adjacency matrices. 5 

Definition 7.1.1 (Diagram) A diagram D is a set of simple digraphs and a set 

of partial injective morphisms {dk}keKjdk '■ A4 —> Aj, i,j 6 /. 

The term ground formula will mean a MSO closed formula which uses P and Q with 
constant nodes (i.e. nodes of a concrete type which can be matched with nodes of the 
same type). 

We will say that a diagram is well-defined if every cycle of morphisms commute and 
injectivity of dij is kept. For example, in the diagram of Fig. 7.3, node typed 2 has two 
different images, 2" and 2'", depending if morphism d\2 ° rfoi is considered or do2- 
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Fig. 7.3. non-Injective Morphisms in Application Condition 

Definition 7.1.2 (Graph Constraint) For a given simple digraph G (host graph), the 
tuple 

GC G = (0, f) = (({Ai}^, {d k } keK ) , f) (7.7) 

is a (constant) graph constraint if d is a diagram and f is a (ground) formula as defined 
above. 

How graph constraints can be expressed using diagrams and logic formulas will be 
illustrated with some examples 6 throughout this section, comparing with the way they 
should be written using FOL and MSOL. 

5 Using the tensor product it is possible to embed the node vector into the adjacency matrix. 
This has not been used in this book except in Chap. 8. See the definition of the incidence 
tensor in Sec. 8.3. 

6 Examples "at most two outgoing edges" below and "3-vertex colorable graph" on p. 170 have 
been adapted from [12]. 
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G 

Fig. 7.4. At Most Two Outgoing Edges 

Example (at most two outgoing edges). nLct's characterize graphs in which every 
node of type 1 has at most two outgoing edges. Using FOL: 

fi = Vyi,y2,V3[edg(l,y 1 ) a edg(l,y 2 ) a 

a edg (1, y 3 ) => y x = y 2 v y x = y 3 v y 2 = J/a] ( 7 -8) 

where function edg (x, y) is true if there exists an edge starting in node x and ending in 
y. In our case, we consider the diagram on the left of Fig. 7.4 together with the formula: 

fi = VAqMi [Ao => (Ai a P V {D, coD))} (7.9) 

where D = Dom(rfio) and coD = coDom(diQ). 

There must be two total injective morphisms rriA 0 ■ Aq — > G, ttia 1 ■ A\ — > G and a 
partial injective morphism m,A 1 A 0 '■ A x —> Aq which does not extend d\o (niA 1 A 0 = dm), 
i.e. elements of type 1 are related and variables y\ and y 2 remain unrelated with y 3 . 
Hence, two outgoing edges are allowed but not three. 

In this case it is also possible to consider the diagram on the right of Fig. 7.4 together 
with the much simpler formula ^ 2 = -^A^A^. This form will be used when the theory is 
extended to cope with multidigraphs in Sec. 7.5.B 

For now we will limit to ground formulas and it will not be until Sec. 7.5 that variable 
nodes are considered. A variable node is one whose type is not specified. 

A graph constraint is a limitation on the shape of a graph, i.e. what elements it is 
made up of. This is something that can always be demanded on any graph, irrespective 
of the existence of a grammar or rule. This is not the case for application conditions 
which need the presence of productions. 
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In the following few paragraphs, application conditions will be introduced. Out of the 
definition it is not difficult to see application conditions as a particular case of graph 
constraints in this framework. 

Definition 7.1.3 (Weak Precondition) Given a production p : L — > R with nihilation 
matrix N , a weak precondition is a graph constraint over G satisfying: 

1. 3\i,j such that Ai = L and Aj = N. 

2. f must demand the existence of L in G. 

3. f must demand the existence of N in G E . 

For technical reasons to be clarified in Sec. 7.4, it is better not to have morphisms 
whose codomains are L nor N, for example di : Ai — > L or dj : Aj — > N. This is not a big 
issue as we may always use their inverses due to di's injectiveness, i.e. one may consider 
d~ Y : L — > Ai and dj 1 : N — » Aj instead. 

Note the similarities between definition 7.1.3 and that of derivation in Sec. 5.1.2. Ac- 
tually, this definition interprets the left hand side of a production and its nihilation matrix 
as a weak precondition. Hence, any well defined production has a natural associated weak 
precondition. 

Starting with the definition of weak precondition we define weak postconditions sim- 
ilarly but using the comatch : R — > H, H = p(G). A precondition is a weak pre- 
condition plus a match mi : L — » G and, symmetrically, a postcondition is a weak 
postcondition plus a comatch tur : R — > H. 

Every production naturally specifies a weak postcondition. Elements that must be 
present are those found at R, while e v D should not be found by the comatch. 

Weak application conditions, weak preconditions and weak postconditions permit the 
specification of restrictions at a grammar definition stage with no need for matches, as 
in Chap. 4. 

Definition 7.1.4 ((Weak) Application Condition) For a production p, a (weak) 
application condition is a (weak) precondition plus a (weak) postcondition, AC = 
(AC L ,AC R ). 

Example. oFigure 7.5 depicts a production with diagram Qlhs = {A} for its LHS 
and diagram Z>rhs = {B} for its RHS. If the associated formula for Vlhs is \lhs = 
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Fig. 7.5. Example of Precondition Plus Postcondition 



3L3A [L ^4] then there are two different possibilities depending on how morphism cLa is 
defined: 

1. CLa identifies node 1 in L and A. Whenever L is matched in a host graph there can 
not be at least one A, i.e. at least for one matching of A - with node 1 in common 
with L - in the host graph either edge (1, 1) or edge (1,3) are missing. 

2. cLa does not identify node 1 in L and A. This does not necessarily mean that they 
must be different when matched in an actual host graph. Now, it is sufficient not to 
find one A which would be fine for any match of L in the host graph. 

Recall that the interpretation of the quantified parts 3L and 3A are, respectively, to 
find nodes 1 and 2 and 1 and 3 (edges too). In the first bullet above, both nodes 1 must 
coincide while in the second case they may coincide or they may be different. 

The story varies if formula \lhs = [LA] is considered. There are again two 

cases, but now: 

1. d,A identifies node 1 in L and A. No other node 3 can be linked to node 1 if it has a 
self loop. 

2. cIa does not identify node 1 in L and A. The same as above, but now both nodes 1 
need not be the same. 

A similar interpretation can be given to the postcondition djins together with formula 
Uhs = 1L3A [LA\ and f RHS = 3LV7L [LA\M 

Remark. □ As commented in the introduction of this chapter, graph constraints are 
normally thought as global conditions on the entire graph while application conditions 
are local properties, defined in the neighbourhood of the match (and usually not beyond). 

In our setting, the use of quantifiers on restrictions permit "local" graph constraints 
and "global" application conditions. The first by using existential quantifiers (so as soon 
as the restriction is fulfilled in one piece of the host graph, the graph constraint is 
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fulfilled) and the latter through universal quantifiers (everytime the application condition 
is matched it must be fulfilled). ■ 

Given a graph G and a graph constraint GC, the next step is to state when G fulfills 
GC. This definition also applies to application conditions. 

Definition 7.1.5 (Graph Constraint Fulfillment) A simple digraph G satisfies the 
graph constraint GC = (f , f), written G |= GC , if there exists a model for the following 
interpretation: 

1. I (P (Xi, Xj)) = m T : Xi — > Xj total injective morphism. 

2. I (Q (Xi,Xj j) = m p : Xi — > Xj partial injective morphism. 

where m T \r) = dk = iti p \d with dk : Xi — > Xj and D = Dom(dk)- It can be the case 
that Dom (m p ) n Dom (dk) = 0- 

Recall that we say that a morphism is total if its domain coincides with the initial 
set and partial if it is a proper subset. 

Remark. nThere can not exist a model if there is any contradiction in the definition of 
the graph constraint. A contradiction is to ask for an element to appear in G and also to 
be in G. In the case of an application condition, some contradictions are avoidable while 
others are not. We will return to this point in Sec. 7.2 with an example and appropriate 
definitions. ■ 

Bearing in mind that the only thing that makes a precondition (postcondition) special 
is that one match = uil {mAi = tj.r) is already fixed, it seems adequate to think 
that application conditions are fulfilled by graphs and not by matches, i.e. in contrast to 
the approach in [14], we do not write mi |= ACl but G \= ACl- 

More correctly, we should write (J, 0) |= f to mean that under interpretation given 
by I and assignments given by morphisms specified in 0, formula f is fulfilled (see Sec. 
2.1). In our opinion, the notation becomes a bit cumbersome and, in the end, what we 
try to express is that G is such that all restrictions can be accomplished. We do not see 
any inconsistency if it is properly interpreted. 

Example (3- vertex colorable graph). nln order to express that a graph G is 3- 
vertex colorable we need to state two basic facts: First, every single node belongs to one 
of three disjoint sets, called X\, X 2 and X 3 : Check first three lines in formula (7.10). 
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Fig. 7.6. Diagram for Three Vertex Colorable Graph Constraint 



Second, every two nodes joined by one edge must belong to different Xi, i = 1,2, 3, which 
is stated in the last two lines of (7.10). Using MSOL: 

fa = 3X U X 2 , X 3 [Vx (x e X x v x e X 2 v x e X 3 ) a 

Vx (V> (x,X 1 ,X 2 ,X 3 ) aiP(x,X 2 ,X 1 ,X 3 ) a 

1 p(x,X 3 ,X 2l X 1 )) a 
Vx, 2/ (eo?5 (x, y) a (x ^ y) => 0 (z, y, X{) a 

<t>{x,y,X 2 ) A<f>(x,y,X 3 ))] (7.10) 

where, 

tp(x,X,Y,Z) = [x e X ^ x $Y a x $ Z] 

4>{x,y,X) = [^(xeX a y e X)] = [x $ X vy$X]. 

In our case, we consider the diagram of Fig. 7.6 and formula 




(7.11) 



where A = (P(A X , Xi) mP(A x , X 2 ) vP(A x , -^3)). Digraphs Xi splits G into three disjoints 
subsets (the three colors) thorugh predicate A, which states the disjointness of Xi and, 
with the rest of the clause, the coverability of G, G = X\ [j X 2 [j X 3 M 
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For an example of a natural application condition that to the best of our knowledge 
can not be expressed using previous approaches to this topic, refer to the one on p. 180. 
More examples will be given in the rest of the chapter. 

7.2 Extending Derivations 

The question of whether our definition of direct derivation is powerful enough to deal 
with application conditions (from a semantical point of view) will be proved in theorem 
7.2.4 and corollary 7.2.5 in this section. It is necessary to check that direct derivations 
can be the codomain of the interpretation function, i.e. "MGG + AC = MGG" and 
"MGG + GC = MGG". 

Note that a direct derivation in essence corresponds to the formula: 



but additional application conditions (AC) may represent much more general properties, 
due to universal quantifiers and partial morphisms. Normally, for different reasons, other 
approaches to graph transformation do not care about elements that can not be present 
at a rule specification level. If so, a direct derivation would be as simple as: 



Another point is that of contradictions inside application conditions, briefly com- 
mented in previous section, which turns a rule into a logical fallacy preventing its appli- 
cation on any scenario. 

Definition 7.2.1 (Consistency, Coherence, Compatibility) Let AC = (f,f) be a 
weak application condition on rule p : L — > R, then: 

• AC is consistent if there exists some host graph G such that G |= AC. 

• AC is coherent if it is not a fallacy. 

• AC is compatible if it describes a simple digraph. 7 

7 If a graph of the AC demands the existence of some edge, it can not be incident to a node 
that is deleted by production p. 




(7.12) 



3L[L]. 



(7.13) 
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According to the set of morphisms {dij : A4 — * Aj}. 

The definitions for application conditions instead of their weak counterparts are al- 
most the same, except that consitency does not ask for the existence of some host graph 
but takes into account the one already considered. 

We will see in Sec. 7.3 (theorem 7.3.5 on p. 189) that there is a very close relationship 
between application conditions and sequences, being in fact possible to transform appli- 
cation conditions into sequences of productions. An application condition will be coherent 
if and only if its associated sequence is coherent and the same for compatibility (this is 
why these concepts have been named this way) . We will see that an application condition 
is consistent if its associated sequence is applicable. Here, morphisms play a similar role 
in the graphs that make up the application condition to completion in sequences of rules. 

Typically, coherence is not fulfilled if the condition simultaneously asks for the exis- 
tence and non-existence of some element. Non-compatibility deals with internal simple 
digraph structure inconsistencies. Examples of non-compatible and non-coherent appli- 
cation conditions follow. 
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Fig. 7.7. Avoidable non- Compatible Application Condition 



Example. □Non-compatibility can be avoided at times just rephrasing the condition 
and the rule. Consider the weak precondition A as represented on the left of Fig. 7.7. 
There is a diagram T) = {^4} with associated formula f = 3^4[A], being morphism 6^(1) = 
1. As the production deletes node 1 and the application condition asks for the existence 
of edge (1, 3), it is for sure that if the rule is applied we will obtain at least one dangling 
edge. 

The key point is that the condition asks for the existence of edge (1, 3) but the 
production demands its non-existence as it is included in the nihilation matrix N. In this 
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case, the rule p' depicted on the right of the same figure is completely equivalent to p 
but with no potential compatibility issues. 



( 4 
V« 3 

d 12 

m- i 

^•3 

Fig. 7.8. non-Coherent Application Condition 

A non-coherent application condition can be found in Fig. 7.8. Morphisms identify 
all nodes: d Ll ({l}) = {1} = d 12 ({l}), d Li ({2}) = {2}, d 12 ({3}) = {3} with formula 
f = 3LVAi3A 2 [L => Ax a P (A 2 ,G)]. There is no problem with edge (1,2) but with 
(1, 1) there is one. Note that due to Ax, it must appear in any potential host graph but 
A 2 says that it should not be present. ■ 

To include application conditions in the Matrix Graph Grammars framework as de- 
veloped so far, two operations on basic diagrams are introduced: Closure (£) and De- 
composition (X)). The first deals with universal quantifiers and the second with partial 
morphisms. In some sense they are complementary (compare equations (7.14) and (7.15)). 
Some notation for the set of morphisms and isomorphisms between two graphs is needed: 

Hom t (Ai, Aj) = {/ : Aj — > Aj | / is a morphism that keeps node types} 
Iso t (Ai, Aj) = {/ e Hom t (Ai, Aj) \ f is injective and surjective} 

In this book we only consider injective graph morphisms which keep types of nodes: 
Only nodes of the same type can be related by a graph morphism. That is the reason 
why we use the notation Horn 1 and Iso 1 instead of the more common Horn and Iso. In 
the rest of the section, AC will represent a weak application condition unless otherwise 
stated. 
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Definition 7.2.2 (Closure) Given AC = (t>, f) with diagram c) = {^4}, ground formula 
f = V^4[A] and a host graph G, the result of applying € to AC is calculated as follows: 



with A 1 = A, dij $ Iso t (A l , A 7 ), £ (AC) = AC = (0', f) and n the number of occurrences 
of A in G. 

The condition that morphism dij must not be an isomorphism means that at least 
one element of A 1 and A 1 will be identified in different places of G. There is an example 
right after the definition of the decomposition operator, on p. 177. 

The interpretation of the closure operator is that demanding the universal appearance 
of a graph is equivalent to the existence of all of its instances in the specified digraph 
(G, G or whatever). Whenever nodes in A are identified in G, edges of A must also be 
found. 

Some nodes can be the same for different identifications (dij), so the procedure does 
not take into account morphisms that identify every single node, dij 0 Iso* (A l ,A^. 
Therefore, each A 1 contains the image of a possible match of A in G (there are n possible 
occurrences of A in G) and d^ identifies elements considered equal. This could also have 
been specified in terms of Pjj. 

The idea behind decomposition is to split a graph into its components to transform 
partial morphisms into total morphisms of one of its parts. If nodes are considered as the 
building blocks of graphs for this purpose, then if two graphs share a node of the same 
type there would be a partial match between them, irrespective of the links established by 
the edges of the graphs. Also, as stated above, we are more interested in the behaviour of 
edges (which to some extent comprises nodes as source and target elements of the edges, 
except for isolated nodes) than on nodes alone as they define the topology of the graph. 
These are the reasons why decomposition operator D is defined to split a digraph A into 
its edges, generating as many digraphs as edges in A. 

If so desired, in order to consider isolated nodes, it is possible to define two decompo- 
sition operators, one for nodes and one for edges. Note however that decomposition for 
nodes makes sense mostly for graphs made up of isolated nodes, or for parts of graphs 



0^0'= ({A 1 



■ ■ , A"}, d i: j : A 1 — > A j ) 




(7.14) 
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consisting of isolated nodes only. In this case, we would be dealing with sets more than 
with graphs (the topology of a graph is defined by its edges). 

Definition 7.2.3 (Decomposition) Given an AC = (D, f) with ground formula f = 
3A[Q(A)], diagram 0 = {A} and host graph G, T> acts on AC - D (AC) = AC = (X)', f) 
- in the following way: 



*'= ({A\...,A"},d, 



A j ) 



f i— > f = 3A 1 . . . 3A™ 



\/A* 



(7.15) 



where n = #{edg(A)}, the number of edges of A. So A 1 a A, containing a single edge of 
digraph A. 

In words: Demanding a partial morphism is equivalent to ask for the existence of a 
total morphism of some of its edges, i.e. each A % contains one and only one of the edges of 
A. It does not seem to be relevant whether A % includes all nodes of A or just the source 
and target nodes. 




Fig. 7.9. Closure and Decomposition 



Example. nWe will consider conditions represented in Fig. 7.9, Aq for closure and A\ 
for decomposition, to illustrate definitions 7.2.2 and 7.2.3. 

Recall that the formula associated to closure is f = VALA]. Closure applied to Aq 
outputs two digraphs, Aq and Aq, and a morphism d\ 2 that identifies nodes 1 and 3. Any 
further match of Aq in G would imply an isomorphism. Equation (7.14) for Aq is 
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f = 3^EL4* [A* a ^ 

with associated diagram 

X>'= ({Al,A 2 0 },d° 12 :Al^Al) (7.17) 

depicted on the center of Fig. 7.9. Note that the maximum number of non-empty partial 
morphisms not being isomorphisms is 2. 

Formula associated to 2) is f = 3A[Q(A,G)]. Decomposition can be found on the 
right of the same figure, in this case with associated formulas: 

X>' =({AlA\},d\ 2 :A\^A\) 

f = 3A\3Al [A\ v A\] . (7.18) 

The number of edges that make up the graph is 2, which is the number of different 
graphs A\M 

Now we get to the main result of this section. The following theorem states that it 
is possible to reduce any formula in an application condition to one using existential 
quantifiers and total morphisms. Recall that, in Matrix Graph Grammars, matches are 
total morphisms. 8 

Theorem 7.2.4 Let GC = (d, f) be a graph constraint such that f = f (P, Q) is a ground 
junction. Then, f can be transformed into f = f'(P) where only existential quantifiers are 
used. 

Proof 

□Define the depth of a graph for a fixed node no to be the maximum over the shortest path 
(to avoid cycles) starting in any node different from no and ending in no. The diagram d 
is a graph 9 with a special node G. We will use the notation depth (GC) = depth (5), the 
depth of the diagram. 

In order to prove the theorem we apply induction on the depth, checking out every 
case. There are sixteen possibilities for depth (0) = 1 and a single element A, summarized 
in table 7.1. 

8 In fact in any approach to graph transformation, to the best of our knowledge. 

9 Where nodes are digraphs Ai and edges are morphisms dij. 



(7.16) 
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(1) 3A[A] 


(5) U[A] 


(9) 3A[Q(A)] 


(13) U[Q(A)] 


(2) 3A[A] 


(6) U[A] 


(10) 3A[Q(A)] 


(14) VA[Q(A)] 


(3) $A[A] 


(7) W1L4] 


(11) tA[Q{A)} 


(15) VA[Q(^L)] 


(4) $A[A] 


(8) VA[A] 


(12) M[Q(^)] 


(16) VA[Q(A)] 



Table 7.1. All Possible Diagrams for a Single Element 



Elements in the same row for each pair of columns are related using equalities = 
VAL4] and Y-A[.A] = 3 -A [A], so it is possible to reduce the study to cases (l)-(4) and 
(9h(12). 10 

Identities Q(A) = P{A,G) and Q(A) = P(A,G) (see also equation (7.6)) reduce 
(9)-(12) to formulas (l)-(4): 

3A[Q(A)] = 3A [P(A,G)] 
3A[Q(A)]=3A \P(A,G)} 
U[Q(A)] = U {P(A,G)] 
U[Q(A)] = U [P(A,G)]. 

What we mean with this is that it is enough to study the first four cases, although it 
will be necessary to specify if A must be found in G or in G. Finally, every case in the 
first column can be reduced to (1): 

• (1) is the definition of match in Sec. 5.1. 

• (2) can be transformed into total morphisms (case 1) using operator J): 



3A p] = 3A [Q(A, G)] = 3A 1 . . . 3A n 



n 



\JP(A\G) 



(3) can be transformed into total morphisms (case 1) using operator €: 



U \A] = VA[A] =3A 1 ... 3A r - 



(7.19) 



(7.20) 



• (4) combines (2) and (3), where operators £ and T> are applied in order DoC (see 
remark below): 



10 Notice that y should be read "not for all. . . " and not "there isn't any. . . ". 
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$A[A] =VA\A\= 3A 11 . . . 3A r ' 



z=i j=i 



(7.21) 



If there is more than one element at depth 1, this same procedure can be applied 
mechanically. Note that if depth is 1, graphs on the diagram are unrelated (otherwise, 
depth > 1). Well-definedness guarantees independence with respect to the order in which 
elements are selected. 

For the induction step, when there is a universal quantifier VA, according to (7.14), 
elements of A are replicated as many times as instances of this graph can be found in the 
host graph. Suppose the connected graph is called B. There are two possibilities: Either 
B is existentially VA3B or universally MAiB quantified. 

If B is existentially quantified then it is replicated as many times as A. There is no 
problem as morphisms dij : Bi — > Bj can be isomorphisms. 11 Mind the importance of 
the order: VA3B # 3BMA. 

If B is universally quantified, again it is replicated as many times as A. Afterwards, 
B itself needs be replicated due to its universality. Note that the order in which these 
replications are performed is not relevant, MAiB = V-BVAB 
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Fig. 7.10. Application Condition Example 

11 If for example there are three instances of A in the host graph but only one of B, then the 
three replicas of B are matched to the same part of G. 
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Examples. nLet be given a diagram like the one in Fig. 7.10 with formula f = 
B-A1V.A23.A3 [A 2 => (Ai a A3)]. Say C stands for conveyor. 12 If a conveyor is connected 
to three conveyors, then they are eventually joint into a single conveyor. Graph G in the 
same figure satisfies the application condition as elements (2 : C), (4 : C) and (5 : C) are 
connected to a single node (3 : C). Graph G' does not satisfy the application condition. 

Note that: 

f = 3A 1 VA 2 3A 3 [Ai => (A 1 a A 3 )] = 3A 1 VA 2 3A 3 \M v (A x a A 3 )] . (7.22) 

Suppose that the second form of f in (7.22) is used. Closure applies to A 2 , so it 
is copied three times with the additional property of mandatorily being identified in 
different parts of the host graph. As A 3 is connected to A 2 it is also replicated. A\ has 
no common element with A 2 so it needs not be replicated. Hence, a single A\ appears 
when the closure operator is applied. Note however that there is no difference if A\ is 
also replicated because all different copies can be identified in the same part of the host 
graph. 

The key point is that A 2 must be matched in different places of the host graph 
(otherwise there should be some isomorphism) and the same may apply to A 3 (as long as 
node (4 : C) in A3 is different for A 3l A' 3 and A") but A\, A[ and A" can be matched in 
the same place. Here there is no difference in asking for three matches of A\ or a single 
match, as long as they can be matched in the same place. A\, A[ and A" are depicted 
on the right of Fig. 7.11. 

In terms of P and Pu, P\j is applied to A 2 and P to A\ and A 3 . ■ 
Theorem 7.2.4 is of interest because derivations as defined in Matrix Graph Grammars 
(the matching part) use only total morphisms and existential quatifiers. An application 
condition AC = (Vac^ac) is a graph constraint GC = (Qacjac) with 13 

Uc = [LaP (N, G) a f GC ] , (7.23) 

so theorem 7.2.4 can be applied to application conditions. 

12 Taken from the study case in App. A. 

13 Actually, it is not necessary to demand the existence of the nodes of N because they are the 
same as those of L. 
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Fig. 7.11. Closure Example 



Corollary 7.2.5 Any application condition AC = (0, f) such that f = f (P, Q) is a ground 
function can be embedded into its corresponding direct derivation. 

This corollary asserts that any application condition can be expressed in terms of 
Matrix Graph Grammars rules. So we have proved the informal equations MGG + AC 
= MGG + GC = MGG. Examples illustrating formulas (7.19), (7.20) and (7.21) and 
previous corollary can be found in Sec. 7.3. 

Remark. nit is not difficult to see that £ and D commute, i.e. €. o J) = D o £. In fact 
in equation (7.21) it does not matter whether D o £ or D o £ is considered. 

Composition J) o £ is a direct translation of VA[^4] which, in first instance, considers 
all appearances of nodes in A and then splits these occurrences into separate digraphs. 
This is the same as considering every pair of single nodes connected in A by one edge 
and take their closure, i.e. £ oS.I 



7.3 Functional Representation 

In this section, operators £ and J) are translated into the functional notation of previous 
sections (see Sec. 2.5 for a quick introduction), inspired by the Dirac or bra-ket notation, 
where productions can be written as R = (L,p). This notation is very convenient for 
several reasons, for example, it splits the static part (initial state, L) from the dynamics 
(element addition and deletion, p). Besides, this will permit us to interpret application 
conditions as sequences or sets of sequences and to study their consistency through 
applicability. 
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Operators £ and 55 will be formally represented as f and T, respectively. Recall that 
T has been used in the demonstration of proposition 6.3.3. 

Let p : L — » R be a production with application condition AC = (£), f). We will follow 
a case by case study of the demonstration of theorem 7.2.4 to structure this section. 

The first case addressed in the proof of theorem 7.2.4 is the most simple: If the nodes 
of A are found in G then its edges must also be matched. 

d = (A,d: L ^ A), f = 3A[A]. (7.24) 

Let id,A be the production that does nothing on A. The set of identities 

(L v A,p} = (L, icLa oj)) = po id,A (7.25) 

proves that 

idX(L) = L v A, (7.26) 

which is the adjoint operator of idA- Here, or is carried out according to identifications 
specified by d. Production idA can be seen as an operator (adjoints are defined only for 
operators). As a matter of fact, it is easy to prove that any production is in particular 
an operator. 14 

So if AC asks for the existence of a graph like in (7.24), it is possible to enlarge the 
production p^po idA- The marking operator T M (Sec. 5.2) enables us to use concate- 
nation instead of composition as in equation (7.25): 

(Lv A,p) = p;id A . (7.27) 

The following lemma has just been proved: 

Lemma 7.3.1 (Match) Let p : L — » R be a production together with an application 
condition as in (7.24)- Its applicability is equivalent to the applicability of the sequence 
p;idA, as in equation (7.27). 

Example. nOn the left of Fig. 7.12 a production and the diagram of its weak appli- 
cation condition is depicted. Let its formula be 3A[A]. On the right, its transformation 
according to (7.27) is represented, but using composition instead of concatenation. ■ 

14 Just define its action. 
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Fig. 7.12. Production Transformation According to Lemma 7.3.1 



We will introduce a kind of conjugate of production idA, to be written idA- On the 
left of Fig. 7.13 there is a representation of idA- It simply preserves (uses but does not 
delete) all elements of A, which is equivalent to demand their existence. On the right we 
have its conjugate, idA, which asks for nothing to the host graph except the existence of 
A in the complement of G. 



A 



a. a 



->■ A 



id A 



G E 



v 

>-G 



G E G 

Fig. 7.13. Identity idA and Conjugate idA for Edges 



V 



If instead of introducing idA directly, a definition on the basis of already known 
concepts is preferred we may proceed as follows. Recall that N = r v eD^ so our only 
chance to define idA is to act on the elements that some production adds. Let 

P e ;p r (7.28) 

be a sequence such that the first production (p r ) adds elements whose presence is to be 
avoided and the second (p e ) deletes them (see Fig. 7.14). The overall effect is the identity 
(no effect) but the sequence can be applied if and only if elements of A are in G E . 

Note that a similar construction does not work for nodes because if a node is already 
present in the host graph, a new one can be added without any problem (adding and 
deleting a node does not guarantee that the node is not in the host graph). 
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The way to proceed is to care only about nodes that are present in the host graph 
as the others, together with their edges, will be present in the completion of the comple- 
ment of G. This is represented by Ajj, where R stands for restriction. Restriction and 
completion are in some sense complementary operations. 




G > H 

Fig. 7.14. id a as Sequence for Edges 



Our analysis continues with the second case in the proof of theorem 7.2.4, which 
states that some edges of A can not be found in G for some identification of nodes in G, 
i.e. yA [A] = 3A p]. This corresponds to operator Ta (decomposition), defined by: 

f A (p) = { Pl ,...,p n }. (7.29) 

Here, pi = p o id^ with A 1 a graph consisting of one edge of A (together with its 
source and target nodes) and n = #{edg(A)}, the number of edges of A. Equivalcntly, 
the formula is transformed into: 



n 



(7.30) 



f = 3A[A] i — > f = 3A 1 . . . 3A" 

=i 

i.e. the matrix of edges that must not appear in order to apply the production is enlarged 
Ni = N v A 1 (being Ni the nihilation matrix of pi). 

If composition is chosen, the grammar is modified by removing rule p and adding the 
set of productions {p\, . . . If the production is part of the sequence q2',P] qi then we 
arc allowing variability on production p as it can be substituted by any pi, i e {1, . . . , n}, 
i.e. q 2 ;p;qi i-» qi\Pi\q\- 

A similar reasoning applies if we use concatenation instead of composition but it is 
not necessary to eliminate production p from the grammar: qi\p\q\ i-» q2\p;idAi;q\. 
Production p and sequence idj^i are related through marking. 
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Lemma 7.3.2 (Decomposition) With notation as above, letp : L — > R be a production 
together with an application condition as in (7.30). Its applicability is equivalent to the 
applicability of any of the sequences 

s l =p;id~ i (7-31) 
where A 1 is defined as in equations (7.19) or (7.30). 

Before moving on to the third case in the proof of theorem 7.2.4, previous results will 
be clarified with a simple example with similar conditions to those of Fig. 7.9. 




Fig. 7.15. Decomposition Operator 



Example. ^Consider production p to the left of Fig. 7.15 and application condition 
A on the center of the same figure. If the associated formula for A is f = 3A [A] then 
three sequences are derived (pi,i e {1,2,3}) with pi = p]id^, being A 1 those depicted 
to the right of Fig. 7.15. ■ 

The third case in the proof of theorem 7.2.4 demands that for any identification of 
nodes in the host graph every edge must also be found. Recall that = MA [A] 

which is associated to operator X4 (closure). Hence, 



f = $A[A] 1— > 3A 1 . . . 3A n 



(7.32) 



This means that more edges must be present in order to apply the production, L 
Vr=i {L v By a similar reasoning to that in (7.26): 




A 1 v L 




L,T A (p) 



p o id^ o 



(7.33) 



the adjoint operator can be calculated: 
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fX(L)=Lv\\/Aij. (7.34) 

As commented above, the marking operator allows us to substitute composition 
with concatenation: 

/\/ (A 1 v , = p; id-gi ; • • ■ ; id^z = p; id A , (7.35) 

where id A = id 21 ° • • • 0 id^. We have proved the following lemma: 

Lemma 7.3.3 (Closure) FFif/i notation as above, let p : L — > R be a production to- 
gether with an application condition as in (7.32). Its applicability is equivalent to the 
applicability of the sequence p; id jy. 




Fig. 7.16. Closure Operator 



Example. ^Consider production p to the left of Fig. 7.16 and application condition 
A on the center of the same figure. If the associated formula for A is f = VA [A] then 
two sequences are derived {pi,i 6 {1, 2}) with pi = p; id^ , being A 1 those depicted to the 
right of Fig. 7.16.B 

The fourth case is equivalent to that known in the literature as negative application 
condition, NAC, which is a mixture of (2) and (3), in which order of composition does 
not matter due to the fact that T and T commute. 15 It says that there does not exist an 
idcntificaction of nodes of A for which all edges in A can also be found, ^j4[A], i.e. for 
every identification of nodes there is at least one edge in G. If we define 

f A (p) = (f A o f A ) (p) = (f A o £4) (p), (7.36) 

then 

15 See remark by the end of Sec. 7.2, on p. 181. 
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AV- 1 ' 



(7.37) 



f = VA[A] i — > 3A 11 . . . 3A mn 

L»=ij=i 

In more detail, if we first apply closure to A then we obtain a sequence of m + 1 
productions, p i — > p; id^; . . . ; id-^, assuming m different matches of A in the host 
graph G. Right afterwards, decomposition splits every A 1 into its components (in this 
case there are n edges in A) . So every match of A in G is transformed to look for at least 
one missing edge, id^ \ — > id-^ v . . . v idjj^. 

Operator Ta acting on a production p with a weak precondition A results in a set of 
productions 

Ta (p) = {pi, ■ ■ ■ ,Pr} 

where r = to™. Each pk is the composition of m + 1 productions, defined as pk = 
P°T'd^^ 0 o . . -oid A ^~^ m . Marking operator T M of Sec. 5.2 permits concatenation instead 
of composition: 

f A {p) = {Pk \Pk =p;S^;...;id jl5 j 5 ,} te{liii)m , | . (7.38) 

Lemma 7.3.4 (Negative Application Conditions) Keeping notation as above, let 
p : L — > R be a production together with an application condition as in (7.37), then 
its applicability is equivalent to the applicability of some of the sequences derived from 
equation (7.38). 

Example. nlf there are two matches and A has three edges, i = 3 and j = 2, then 
equation (7.37) becomes: 

3 2 _ _ _ 

/W/AV = (A 11 v A 12 ) (A 21 v A 22 ) (U 31 v A 32 ) 

i=l j = l 

= A^A^A 3 " 1 v A^A^A 32 v ... v A^A^A* 1 v I* 2 1 22 1 32 . 

For example, the first monomial A 11 A 21 A 31 is the sequence 

Pi ^^tt! ^d-^2i \ id^pjj.B 

Summarizing in a sort of rule of thumb, there are two operations - and and or - that 
might be combined using the rules of monadic second order logics. These operations are 
transformed in the following way: 
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• Operation and in the f of an application condition becomes an or when calculating 
an equivalent production. 

• Operation or enlarges the grammar with new productions, removing the original rule 
if composition instead of concatenation is chosen. 




Fig. 7.17. Example of Diagram with Two Graphs 



Example. nLet AC = (c),f) be a graph constraint with diagram 5 depicted in Fig. 
7.17 (graphs shown in Fig. 7.18) and associated formula f = 3L\/Aq3Ai [L (Aq => A\)], 
dw ({!}) = {!}■ Let morphisms be defined as follows: d^\ ({1}) = {1}, dio ({1}) = {1} 
and d w ({2}) = {2}. 

The interpretation of f is that L must be found in G (for simplicity iV is omitted) 
and whenever nodes of Aq are found then there must exist a match for the nodes of A\ 
such that there is an edge joining both nodes. 

Note that matching of nodes of Aq and A\ must coincide (this is what d\Q is for) and 
that node 1 has to be the same as that matched by tul for L in G (morphisms duy and 
dm). 

Application of operator T for the universal quantifier yields six digraphs for Aq and an- 
other six for Ai, represented in Fig. 7.18. Note that in this case we have A 0 = P E (Ag, G) 
because A 0 has only one edge. Suppose that mi({l,2,3}) = {1",2',3}, then f becomes 

fx = 3L3A^A 5 0 3ApAl [l (if v Af) (if v A?)] . (7.39) 

Different matches and relations among components of the application condition derive 
different formulas f. For example, we could fix only node 1 in dio, allowing node 2 to 
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Fig. 7.18. Precondition and Postcondition 



be differently matched in G. Notice that neither A\ nor A\ exist in G so the condition 
would not be fulfilled for Aq or A® because terms Aq v Aq and A\ v A\ would be false 
(Aq and Aq are in G for any identification of nodes). ■ 

Previous lemmas prove that weak preconditions can be reduced to studying sequences 
of productions. If instead of weak preconditions we have preconditions then we should 
study derivations (or sets of derivations) instead of sequences. 

Theorem 7.3.5 Any weak precondition can be reduced to the study of the corresponding 
set of sequences. 

Proof 

□This result is the sequential version of theorem 7.2.4. The four cases of its proof corre- 
spond to lemmas 7.3.1 through 7.3.4.B 

Example. □Continuing example on p. 188, equation (7.39) put in normal disjuntive 
form reads 

fx = ILBA^A^AfBAl [lA^AJ v LA%A\ v LA(Aj v LA\A\^ (7.40) 
which is equivalent to 

fa = 3L3A^3ApAf3Al [LA\A\] 
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because Aq and Aq can be found in G. This is the same as applying the sequence 
p;id^i;id^5 or p; id^s ; id^t (because id a\ Lid ^45). 

So fulfillment of AC, once match m_L has been fixed, 16 is equivalent to the applicability 
of the sequence to which equation (7.40) gives raise. ■ 

One application of this theorem is to test if a weak precondition specifies a tautology 
or a fallacy by using the theory developed so far for sequences. It will also be used 
in the next section to study how to construct weak postconditions equivalent to given 
weak preconditions. It is also useful to proceed in the opposite way, i.e. to transform 
postconditions into equivalent preconditions. 

Corollary 7.3.6 A weak precondition is coherent if and only if its associated sequence 
(set of sequences) is coherent. Also, it is compatible if and only if its sequence (set of 
sequences) is compatible and it is consistent if and only if its sequence (set of sequences) 
is applicable. 

Example. nFor coherence wc will change the formula of previous examples a little. 
Consider $2 = 3_LV.Ao3.Ai [L (A_i => A_o)]. Note that f2 cannot be fulfilled because on the 
one hand edges (1,1) and (1,2) must be found in G and on the other edge (1,1) must 
be in G. 

To simplify the example, suppose that some match is already given. The sequence to 
study is p; idA^idAo, which is not coherent because in its equivalent form p; idA^P^Pa 
production p$ deletes edge (1,1) used by zd^.B 

Corollary 7.3.7 A weak precondition is consistent if and only if it is coherent and com- 
patible. 

Remark. ...Something left undefined is the order of productions id a and id a in the 
sequence. Consistency does not depend on the ordering of productions as long as the 
last to be applied is production p. Either there is sequential independence or, if not, any 
sequence detects the inconsistency. ■ 

16 In this example. In general it is not necessary to fix the match in advance. 
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7.4 Moving Conditions 

Roughly speaking, there have been two basic ideas in previous sections that allowed us 
to check consistency of the definition of direct derivations with weak preconditions, and 
also provided us with some means to use the theory developed so far in order to continue 
the study of application conditions: 

• Embed application conditions into the production or derivation. The left hand side L 
of a production receives elements that must be found - P(A, G) - and N those whose 
presence is forbidden - P(A, G) -. 

• Find a sequence or set of sequences whose behaviour is equivalent to that of the 
production plus the application condition. 

In this section we will care about how (weak) preconditions can be transformed into 
(weak) postconditions and viceversa: Given a weak precondition A, what is the equivalent 
weak postcondition (if any) and how can one be transformed into the other?. Before this, 
it is necessary to state the main results of previous sections for postconditions. 

The notation needs to be further enlarged so we will append a left arrow on top 
of conditions to indicate that they are (weak) preconditions and an upper right arrow 
for (weak) postconditions. Examples are A for a weak precondition and A for a weak 
postcondition. If it is clear from context, we will omit arrows. 

There is a direct translation of theorem 7.2.4 for postconditions. Operators T-* and 

are defined similarly for weak postconditions. Again, if it is clear from context, it will 
not be necessary to overelaborate the notation. 

Equivalent results to lemmas in Sec. 7.3, in particular to equations (7.27), (7.31), 
(7.35) and (7.38) are given in the following proposition: 
— > 

Proposition 7.4.1 Let A= (f, 5) = (f, ({A},d : R — > A)) be a weak postcondition. Then 
we can obtain a set of equivalent sequences to given basic formulae as follows: 

(Match) f = 3A[A] .— » T A (p)=id A ;p. (7.41) 

(Decomposition) f = 3A[A] i — > Ta (p) = id A ',P- (7.42) 

(Closure) f = $A[A] i — > f A (p) = id A i;. . . ; id A ™;p. (7.43) 

(NAC) f = $A[A] .— » f A (p) = Jd A ^o ; . . .;Td A ^ m ;p. (7.44) 
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Proof 
□ ■ 

There is a symmetric result to theorem 7.3.5 for weak postconditions that directly 
stems from proposition 7.4.1. The development and ideas are the same, so we will not 
repeat them here. 

Theorem 7.4.2 Any weak postcondition can be reduced to the study of the corresponding 
set of sequences. 

Proof 
□■ 

Corollaries 7.3.6 and 7.3.7 have their versions for postconditions which are explicitely 
stated for further reference. 

Corollary 7.4.3 A weak postcondition is coherent if and only if its associated sequence 
(set of sequences) is coherent. Also, it is compatible if and only if its sequence (set of 
sequences) is compatible and it is consistent if and only if its sequence (set of sequences) 
is applicable. 

Corollary 7.4.4 A weak prostcondition is consistent if and only if it is coherent and 
compatible. 

Let p : L — > R be a production applied to graph G such that p(G) = H. Elements 
to be found in G are those that appear in L. Similarly, elements that are mandatory in 
the "post" side are those in R. The evolution of the positive part (to be added to L) of 
a weak application condition is given by the grammar rule itself. 

The evolution of the negative part N has not been addressed up to now as it has not 
been needed. Let's represent by Nl the negative elements of the LHS of the production 
and by Nr those elements that must not be present in the RHS. 

Proposition 7.4.5 Let p : L — > R be a compatible production with negative left hand 
side Nl and negative right hand side Nr. Then, 



N R = p- 1 (N L ) . 



(7.45) 
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Proof 

□First suppose that Nl is the one naturally defined by the production, i.e. the one found 
in lemma 4.4.5. The only elements that should not appear in the RHS are potential 
dangling edges and those deleted by the production: e v D. It coincides with (7.45) as 
shown by the following set of identities: 

P~ l (Nl) = e v r Nl = e v r (r v eD) = e v erD = e v r D = e v D. (7.46) 

In the last equality of (7.46) compatibility has been used, rD = D. Now suppose that 
Nl has been modified, adding some elements that should not be found in the host graph 
(theorem 7.3.5). There are three possibilities: 

• The element is erased by the production. This case is ruled out by corollary 7.3.6 as 
the weak precondition could not be coherent. 

• The element is added by the production. Then, in fact, the condition is superfluous 
as it is already considered in Nl without modifications, i.e. (7.46) can be applied. 

• None of the above. Then equation (7.45) is trivially fulfilled because the production 
does not affect this element. 

Just a single element has been considered to ease exposition. ■ 

Remark. nThough strange at a first glance, a dual behaviour of the negative part 
of a production with respect to the positive part should be expected. The fact that Nl 
uses p~ x rather than p for its evolution is quite natural. When a production p erases one 
element, it asks its LHS to include it, so it demands its presence. The opposite happens 
when p adds some element. For N things happen quite in the opposite direction. If the 
production asks for the addition of some element, then the size of Nl is increased while 
if some element is deleted, Nl shrinksM 

Now we can proceed to prove that it is possible to transform preconditions into 
postconditions and back again. Proposition 7.4.5 allows us to consider the positive part 
only. The negative part would follow using the inverse of the productions. 

There is a restricted case that can be directly addressed using equations (7.41) - 
(7.44), theorems 7.3.5 and 7.4.2 and corollaries 7.3.6 and 7.4.3. It is the case in which the 
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transformed postcondition for a given precondition does not change. 17 The question of 
whether it is always possible to transform a precondition into a postcondition - and back 
again - would be equivalent to asking for sequential independence of the production and 
identities, i.e. whether id^i _L p or not. 

In general the production may act on elements that appear on the definition of the 
graphs of the precondition. Recall that one demand on precondition specification is that 
L and N are always the domain of their respective morphisms and djv (refer to 
comments on p. 168). The reason for doing so will be clarified shortly. 




Fig. 7.19. (Weak) Precondition to (Weak) Postcondition Transformation 



Theorems on this and previous sections make it possible to interpret preconditions 
and postconditions as sequences. The only difference is that preconditions are represented 
by productions to be applied before p while postconditions need to be applied after p. 
Hence, the only thing we have to do to transform a precondition into a postcondition (or 
viceversa) is to pass productions from one part to the other. The case in which we have 
sequential independence has been studied above. If there is no sequential independence 
the transformation can be reduced to a pushout construction 18 - as for direct derivation 

17 Note that this is not so unrealistic. For example, if the production preserves all elements 
appearing in the precondition. 

18 The square made up of L, R, A and A is a pushout where p, L, dL, R and A are known and 
A, Pa and di need to be calculated. Recall from Sec. 5.1 that production composition can 
be used instead of pushout constructions. The same applies here, but we will not enter this 
topic for now. 
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definition - except for one detail: In direct derivations matches are total morphisms while 
here dj, and djy need not be (see Fig. 7.19). 

The way to proceed is to restrict to the part in which the morphisms are defined (they 
are trivially total in that part). For example, the transformation for the weak application 
condition depicted to the left of Fig. 7.20 is a pushout. It is again represented to the 
right of the same figure. 




Fig. 7.20. Restriction to Common Parts: Total Morphism 



The notation is extended to represent this transformation of preconditions into post- 
conditions as follows: 

A= p (I) ■ (7.47) 

To see that precondition fulfillment is equivalent to postcondition fulfillment all we 
have to do is to use their representation as sequences of productions (theorems 7.3.5 
and 7.4.2). Note that applying p delays the application of the result (the id a or id a 
productions) in the sequence, i.e. we have a kind of sequential independence except that 
productions can be different (id<- ^ id->) because they may be modified by p: 

p;id^ i— » id 2 ;p. (7.48) 

If the weak precondition is consistent so must the weak postcondition be. There can 
not be any compatibility issue and coherence is maintained (again, idA and idA may be 
modified by the production). Production p deals with the positive part of the precondition 
and, by proposition 7.4.5, p~ x will manage the part associated to N. For the post-to-pre 
transformation roles of p and p~ l are interchanged. 
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Pre-to-post or post-to-pre transformations do not affect the shape of the formula 
associated to a diagram except in the case where redundant graphs are discarded. There 
are two clear examples of this: 

• The application condition requires the graph to appear and the production deletes 
all its elements. 

• The application condition requires the graph not to appear and the production adds 
all its elements. 

Recalling that there can not be any compatibility nor coherence problem due to 
precondition consistency, consistency permits the transformation, proving the main result 
of this section: 

Theorem 7.4.6 Any consistent (weak) precondition is equivalent to some consistent 
(weak) postcondition and viceversa. 

Proof 



This result allows us to extend the notation to consider the transformation of a 
precondition. A postcondition is the image of some precondition, and viceversa: 



As commented above, for a given application condition AC it is not necessarily true 
that A = p^ 1 ; p(A) because some new elements may be added and some obsolete elements 
can be discarded. What we will get is an equivalent condition adapted to p that holds 
whenever A holds and fails to be true whenever A is false. 

Example, nln Fig. 7.21 there is a very simple transformation of a precondition into 
a postcondition, through morphism p(A). The production deletes one arrow and adds a 
new one. The overall effect is reverting the direction of the edge between nodes 1 and 2. 

The opposite transformation, from postcondition to precondition, can be obtained by 
reverting the arrow, i.e. through p~ 1 (A). More general schemes can be studied apply- 
ing the same principles, although diagrams will be a bit cumbersome with only a few 
application conditions. 




(7.49) 
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Fig. 7.21. Precondition to Postcondition Example 



Let A = p 1 op yAj- If a pre-post-pre transformation is carried out, we will have 

A¥= A because edge (2,1) would be added to A- However, it is true that A = p^ 1 op [A). 

Note that in fact id<^-Lp if we limit ourselves to edges, so it would be possible to simply 
move the precondition to a postcondition as it is. Nonetheless, we have to consider nodes 
1 and 2 as the common parts between L and A- This is the same kind of restriction than 
the one illustrated in Fig. 7.20.B 

If the pre-post-pre transformation is thought as an operator T p acting on application 
conditions, then it fulfills 

T p 2 = id, (7.50) 

where id is the identity. The same would also be true for a post-pre-post transformation. 

Theorem 7.4.6 can be generalized at least in two ways. We will just sketch how to 
proceed as it is not difficult with the theory developed so far. 

Firstly, an application condition has been transformed into an equivalent sequence 
of productions (or set of sequences) but no e-productions have been introduced to help 
with compatibility of the application condition. Think of a production that deletes one 
node and that some graph of the application condition has an edge incident to that node 
(and that edge is not deleted by the production). So to speak, we have a DPO-like pre 
to post transformation theorem. It should not be very difficult to proceed as in Chap. 5 
to define a SPO-like behaviour. 
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Secondly, application conditions can now be thought as properties of the production, 
and not necessarily as part of its left or right hand sides. It is not difficult to see that, for 
a given sequence of productions, application conditions arc to some extent delocalized in 
the sequence. In particular it would be possible to pass conditions from one production 
to others inside a sequence (paying due attention to compatibility and coherence). Note 
that a postcondition for p\ in the sequence P2 ; Pi might be translated into a precondition 
for P2, and viceversa. 19 

When defining diagrams some "practical problems" may turn up. For example, if the 
diagram d = ( L -^>° Aq <^ A\ J is considered then there are two potential problems: 



1. The direction in the arrow A$ <— A\ is not the natural one. Nevertheless, injectiveness 
allows us to safely revert the arrow, dot = ^io ■ 

2. Even though we only formally state cIlo and d±o, other morphisms naturally appear 
and need to be checked out, e.g. d^i : R — » A\. New morphisms should be considered 
if they relate at least one element. 20 

Also, a possible interpretation of (7.50) is that the definition of the application con- 
dition can vary from the natural one, according to the production under consideration. 
Pre-post-pre or post-pre-post transformations adjust application conditions to the cor- 
responding production. 

Let's end this section relating graph constraints and moving conditions. Recall equa- 
tion (7.23), in which a first relationship between application conditions and graph con- 
straints is established. That equation states how to enlarge the requirements already 
imposed by a graph constraint to a given host graph if, besides, a given production is to 
be applied. 

Another different though related point is how to make productions respect some 
properties of a graph. This topic is addressed in the literature, for example in [22]. The 

19 This tranformation can be carried out under appropriate circumstances, but we are not limited 
to sequential independence. Recall that productions specifying constraints can be advanced 
or delayed even though they are not sequential independent with respect to the productions 

that define the sequence. 

20 Otherwise stated: Any condition made up of n graphs Ai can be identified as the complete 

graph K n , in which nodes are Ai and morphisms are dij . Whether this is a directed graph or 
not is a matter of taste (morphisms are injective) . 
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proposed way to proceed is to transform a graph constraint into a postcondition first and 
a precondition right afterwards. The equivalent condition to (7.23) would be 

f PC = 3R3N r [RaP(N r ,G) a f GC ] (7.51) 

being fee the graph constraint to be kept by the production. 

7.5 From Simple Digraphs to Multidigraphs 

In this section we show how it is possible to consider multidigraphs (directed graphs 
allowing multiple parallel edges) without changing the theory developed so far. At first 
sight this might seem a hard task as Matrix Graph Grammars heavily depend on ad- 
jacency matrices. Adjacency matrices are well suited for simple digraphs but can not 
deal with parallel edges. This section is a theoretical application of graph constraints and 
application conditions to Matrix Graph Grammars. 

Before addressing multidigraphs, variable nodes are introduced as one depends on the 
other. We will follow reference [34] to which the reader is referred for further details. 

If instead of nodes of fixed type variable types are allowed, we get a so called graph 
pattern. A rule scheme is just a production in which graphs are graph patterns. A substitu- 
tion function l specifies how variable names taking place in a production are substituted. 
A rule scheme p is instantiated via substitution functions producing a particular produc- 
tion. For example, for substitution function l we get p L . The set of production instances 
for p is defined as the set I(p) = {p L \ i is a substitution}. 

The kernel of a graph G, fcer(G), is defined as the graph resulting when all variable 
nodes are removed. It might be the case that ker(G) = 0. 

The basic idea is to reduce any rule scheme to a set of rule instances. Note that it 
is not possible in general to generate T(p) because this set can be infinite. The way to 
proceed is simple: 

1. Find a match for the kernel of L. 

2. Induce a substitution i such that the match for the kernel becomes a full match 
m:L L ^G. 

v L 

3. Construct the instance R L and apply p L to get the direct derivation G => H. 
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Mind the non-determinism of step (2) , which is matching. Rule schemes are required 
to satisfy two conditions: 

1. Any variable name occurs at most once in L. 

2. Rule schemes do not add variable nodes. 

These two conditions greatly simplify rule application when there are variable nodes, 
specially for the DPO approach. In our case they are not that important because, among 
other things, matches in Matrix Graph Grammars are injective. 

Let's start with multidigraphs and how it is possible to extend Matrix Graph Gram- 
mars to cope with them without any major modification. The idea is not difficult: A 
special kind of node (call it multinode) associated to every edge in the graph is intro- 
duced. Graphically, they will be represented by a filled square. 

Now two or more edges can join the same nodes, as in fact there are multinodes in 
the middle that convert them into simple digraphs. The term multinode is just a means 
to distinguish them from the rest of "normal" nodes that we will call simple nodes and 
will be represented as usual with colored circles. They are not of a different kind as for 
example hyperedges with respect to edges (see Sec. 3.4). In our case, simple nodes and 
multinodes are defined similarly and obey the same rules, although their semantics differ. 

There are some restrictions to be imposed on the actions that can be performed on 
multinodes (application conditions) or, more precisely, the shape or topology of permitted 
graphs (graph constraints). 

Operations previously specified on edges now act on multinodes. Edges are managed 
through multinodes: Adding an edge is transformed into a multinode addition and edge 
deletion becomes multinode deletion. Still, there are edges in the "old" sense, to link 
multinodes to their source and target simple nodes. We will touch on e-productions later 
in this section. 
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Fig. 7.22. Multidigraph with Two Outgoing Edges 
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Example. □ Consider the simple production in Fig. 7.22 with two edges between nodes 
1 and 3. Multinodes are represented by square nodes while normal nodes are left un- 
changed. When p deletes an edge, p T deletes a multinode. Adjacency matrices for p T 
are: 
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0 0 0 111 

0 0 0 0 0 0 

0 0 0 0 0 0 

0 0 10 0 0 

0 0 10 0 0 
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Adjacency matrices are more sparse because simple nodes are not directly connected 
by edges anymore. Note that the number of edges must be even. 

In a real situation a development tool such as AToM 3 should take care of all these 
representation issues. A user would see what appears on the left of Fig. 7.22 and not 
what is depicted on the right of the same figure. From a representation point of view we 
can safely draw p instead of p T . In fact, according to theorem 7.5.1, it does not matter 
which one is used. 

Some restrictions on what a production can do to a multidigraph are necessary in 
order to obtain a multidigraph again. Think for example the case in which after applying 
some productions we get a graph in which there is an isolated multinode (which would 
stand for an edge with no source nor target nodes). 

The question is to find the properties that define one edge and impose them on 
multinodes as graph constraints. This way, multinodes will behave as edges. In the bullets 
that follow, graphs between brackets can be found in Fig. 7.23: 



One edge always connects two nodes (diagram 0i, digraphs Co and C\). 
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• Simple nodes can not be directly connected by one edge (D 0 and Eq). Now edges 
start in a simple node and end in a multinode or viceversa, linking simple nodes with 
multinodes but not simple nodes between them. 

• A multinode can not be directly connected to another multinode (D\ and E\). The 
contrary would mean that an edge in the simple digraph case is incident to another 
edge, which is not possible. 

• Edges always have a single simple node as source (£2) and a single simple node as 
target (E 3 ). 21 
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Fig. 7.23. Multidigraph Constraints 



The graph constraint consists of three parts: First diagram Di is closely related to 
compatibility of the multidigraph 22 and has associated formula: 

fi =\/X3C 0 3C 1 3A3B[XA(C 0 V-BCi)]. (7.52) 

21 This condition can be relaxed in case hyperedges were considered. See Sec. 3.4. 

22 Note that now there are "two levels" when talking about a graph. For example, if we say 

compatibility we may mean compatibility of the multidigraph (left side in Fig. 7.22) or of the 
underlying simple digraph (right side in Fig. 7.22) which are quite different. In the first case 



7.5 From Simple Digraphs to Multidigraphs 203 



Diagram D2 and formula 

fa = VD 0 VD 1 \DoDt] (7.53) 

prevents that a simple node or a multinode could be linked by an edge to itself. Self loops 
should be represented as in Cq. 

Finally, when considering two or more simple nodes or multinodes, configurations in 
diagram O3 are not allowed. Its associated formula is: 

f 3 = VE 0 VE 1 VE 2 \/E 3 [Q (E 0 )Q {Ex)E 2 E{\ . (7.54) 

This set of constraints will be known as multidigraph constrains, and the abbreviation 
MC = (0i u 02 u f 3, fi a f 2 a f 3 ) will be used. Please, refer to Fig. 7.23. 



s 4 



Fig. 7.24. Simplified Diagram for Multidigraph Constraint 

Some of these diagrams could be merged, also unifying (and simplifying a little bit) 
their corresponding formulas. For example, instead of Do, D\, Eq and E\ we could 
have considered the diagram in Fig. 7.24. Its associated formula would have been f4 = 
V-Fo [Q(Fo)]. However, a new constraint needs to consider the case in which a single 
simple node or a single multinode is found in the host graph (as these two cases are not 
taken into account by (o 4 ,f4)). 

Theorem 7.5.1 (Multidigraphs) Any multidigraph is isomorphic to some simple graph 
G together with multidigraph constraint MC = (f, U), with d as defined in Fig. 7.23 and 
f as m (7.52), (7.53) and (7.54). 

we talk about edges connecting nodes while in the second we speak of edges connecting some 
node with some multinode. 
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Proof (sketch) 

□A graph with multiple edges M = (V,E,s,t) consists of disjoint finite sets V of nodes 
and E of edges and source and target functions s : E — » V and t : E — > V, respectively. 
Function v = s(e), v e V, e e E returns the node source v for edge e. We are considering 
multidigraphs because the pair function (s,t) : E — > V x V need not be injective, i.e. 
several different edges may have the same source and target nodes. We have digraphs 
because there is a distinction between source and target nodes. This is the standard 
definition found in any textbook. 

It is clear that any M can be represented as a mutidigraph G satisfying MC. The 
converse also holds. To see it, just consider all possible combinations of two nodes and 
two multinodes and check that any problematic situation is ruled out by MCM 

The multidigraph constraint MC = (f , d) must be fulfilled by any host graph. If there 
is a production p : L — > R involved, MC has to be transformed into an application condi- 
tion over p. In fact, the multidigraph constraint should be demanded both as precondition 
and postcondition (recall that we can transform preconditions into postconditions and 
viceversa). In Sec. 7.1 we saw that this is an easy task in Matrix Graph Grammars: See 
equation (7.23) and also (7.51). This is a clear advantage of being able to relate graph 
constraints and application conditions. 

This section is closed analysing what behaviour we have for multidigraphs with respect 
to dangling edges. With the theory as developed so far, if a production specifies the 
deletion of a simple node then an ^-production would delete any edge incident to this 
simple node, connecting it to any surrounding multinode. But restrictions imposed by 
the multidigraph constraint do not allow this so any production with potential dangling 
edges can not be applied. Thus, we have a DPO-like behaviour with respect to dangling 
edges for multidigraphs. 

In order to have a SPO-like behaviour e-productions need to be restated, defining 
them at a multidigraph level, i.e. e-productions have to delete any potential "dangling 
multinode". A new type of productions (^-productions) are introduced to get rid of 
annoying edges 23 that would dangle when multinodes are also deleted by £-productions. 

23 Edges connect simple nodes and multinodes. 
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We will not develop it in detail and will limit to describe the concepts. The way to 
proceed is very similar to what has been studied in Sec. 5.1, by defining the appropriate 
operator T~ and redefining T e . 



1* 



1 2 
V., G 



O 

2 

H 



b 



^3 





• 0 • 




0 


b 








• 1 








1 h 2 








P. 






P 






na 1 na 2 








0 








0 




• 3 R 




a 2 

a 


L 




R 




L 




R 



T 



• 0 • 

1 b 2 

ma, /a, 



3 «3 G, 

Fig. 7.25. e-production and H-production 
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A production p : L — > R between multidigraphs that deletes one simple node may 
give raise to one e-production that deletes one or more multinodes. This e-production 
can in turn be applied only if any edge has already been erased, hence possibly provoking 
the appearance of one S-production. 

This process is depicted in Fig. 7.25 where, in order to apply production p, productions 
p e and ps need to be applied before. 

p — >p\p e ;p E (7.55) 

Eventually, one could simply compose the H-production with its e-production, renam- 
ing it to e-production and defining it as the way to deal with dangling edges in case of 
multiple edges, fully recovering a SPO-like behaviour. As commented above, a potential 
user of a development tool such as AToM 3 would still see things as in the simple digraph 
case, with no need to worry about ^-productions. 
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7.6 Summary and Conclusions 

In this chapter, graph constraints and aplication conditions have been introduced and 
studied in detail for the Matrix Graph Grammar approach. Our proposal considerably 
generalizes previous efforts for other approaches such as SPO or DPO. 

Generalization is not necessarily good in itself, but in our opinion it is interesting in 
this case. We have been able to "reduce" graph constraints and application conditions 
one to each other, which turned out to be an interesting property in Sec. 7.5. Besides, 
the left hand side, right hand side and nihilation matrices appear as particular cases of 
this more general framework, giving the impression of being a very natural extension of 
the theory. Also, it is always possible to embed application conditions in Matrix Graph 
Grammars direct derivations (theorem 7.2.4 and corollary 7.2.5). We have managed to 
study preconditions, postconditions and their weak counterparts, independently to some 
extent of any match. 

Other interesting points are that application conditions seem to be a good way to 
synthesize closely related grammar rules. Besides, they allow us to partially act on the 
nihilation matrix (recall that the nihilation matrix was directly derived out of L, e and 
r). 

Representing application conditions using the functional notation introduced for pro- 
ductions and direct derivations allowed us to prove a very useful fact: Any aplication 
condition is equivalent to some sequence of productions (or a set of them). Theorems 
7.3.5 and 7.4.2. 

It is worth stressing the importance of the relationship between application condi- 
tions and sequences of productions. This for example permitted the translation of the 
notion of consistency of application condition to applicability of a sequence - problem 1 - 
(corollaries 7.3.6 and 7.4.3). Hence, consistency of application conditions can be studied 
through any of the two characterizations given in theorem 5.5.1 (for example corollaries 
7.3.7 and 7.4.4). This is another point that makes us think that application conditions 
as introduced here are a natural extension of the theory. 

Preconditions and postconditions happen to be closely related and any consistent 
precondition can be translated into a consistent postcondition. In this way, restrictions 
- which at times are better thought as properties - are actually delocalized inside the 
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production (theorem 7.4.6). For future work, the fact that application conditions are 
delocalized along productions of a sequence and how to deal with SPO-like application 
conditions (more flexible than DPO-likc ones) will be further investigated. Just as ap- 
plicability helps with consistency of an application condition, sequential independence - 
problem 3 - applies to application condition derealization. It could then be possible to 
apply the corresponding theorems of Chap. 6 to study derealization. 

After variable nodes are introduced, it is not difficult to "generalize" the theory to 
deal with multidigraphs instead of simple digraphs without any major modification. 

The next chapter addresses one fundamental topic in grammars: Reachability. This 
topic has been stated as problem 4 and is widely addressed in the literature, specially 
in the theory of Petri nets. Chapter 9 closes this book with a general summary, some 
more conclusions and proposals for further research. Appendix A presents a worked out 
example to illustrate all the theory developed in this dissertation, focusing more on the 
practical side of the theory. 



8 



Reachability 



In this chapter we will brush over reachability, presented as problem 4 in Sec. 1.2. It is 
an important concept both from practice and theory. Given a grammar G recall that, 
for fixed initial Sq and final St states, reachability solves the question of whether it is 
possible to go from So to St with productions in G. It should be of capital importance to 
provide one or more sequences that carry this out, or identify that St is unreachable. At 
least, it should be very valuable to gather some information of what productions would 
be involved and their number. 

So far, this problem is easily related (in the sense that it depends on) to problem 1, 
applicability, because we look for a sequence applicable to Sq. Also problem 3 contributes 
because if it is not possible to give a concrete sequence but a set of productions (the order 
is unkonwn) together with the number of times that production appears in the sequence, 
problem 3 may reduce the size of the search space (to find out one concrete sequence 
that transforms So into St)- 

The chapter is organized as follows. Section 8.1 introduces Petri nets and explains 
why the state equation is a necessary but not a sufficient condition. In Sec. 8.2 Petri 
nets are interpreted as a proper subset of Matrix Graph Grammars. Also, the concept 
of initial marking (minimal initial digraph) is defined and the main concepts of Matrix 
Graph Grammars are revisited for Petri nets. The rest of the chapter enlarges the state 
equation to cope with more general graph grammars. We will make use of the tensor 
notation introduced in Sec. 2.4. First, in Sec. 8.3 for fixed Matrix Graph Grammars 
(grammars with no dangling edges) and Sec. 8.4 for general Matrix Graph Grammars 
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(floating grammars). As in every chapter, we finish with a summary in Sec. 8.5 with 
some further comments, in particular on other problems that can be addressed similarly 
to what is done here for reachability. 

8.1 Crash Course in Petri Nets 

A Petri net (also a Place/Transition net or P/T net) is a mathematical representation of 
a discrete distributed system, [53] . The structure of the distributed system is depicted as 
a bipartite digraph. There arc place nodes, transition nodes and arcs connecting places 
with transitions. A place may contain any number of tokens. A distribution of tokens 
over the places is called a marking. A transition is enabled if it can fire. When a transition 
fires consumes tokens from its input places and places a number of tokens in its output 
places. The execution of Petri nets is non-deterministic, so they are appropriate to model 
concurrent behaviour of distributed systems. More formally, 

Definition 8.1.1 (Petri Net) A Petri net is a 5-tuple PN = (P,T,F,W,M 0 ) where 

• P = {pi, . . . ,p m } is a finite set of places. 

• T = {ti, . . . ,t n } is a finite set of transitions. 

• F c (P x T) u (T x P) is a set of arcs. 

• W : F — > N > 1 is a weight function. 

• M 0 : P — » N is the initial marking. 

• P n T = 0 and P u T # 0. 

The set of arcs establishes the flow direction. A Petri net structure is the 4-tuplc 
N = (P,T,F,W), in which the initial marking is not specified. Normally, a Petri net 
with a initial marking is written PN = (N,Mq). 

Algebraic techniques for Petri nets are based on the representation of the net with 
an incidence matrix A in which columns are transitions. Element Aj is the number of 
tokens that transition i removes - negative - or adds - positive - to place j. 

One of the problems that can be analyzed using algebraic techniques is reachability. 
Given an initial marking Mo and a final marking Md, a necessary condition to reach Md 
from M 0 is to find a solution x to the equation Md = M 0 + Ax, which can be rewritten 
as the linear system 
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M = Ax. (8.1) 

Solution x - known as Parikh vector - specifies the number of times that each tran- 
sition should be fired, but not the order to fire them. Identity (8.1) is the state equation. 
Refer to [53]. 

The ideas presented up to the end of the section are interpretations of the author and 
should not be considered as standard in the theory of Petri nets. 

The state equation introduces a matrix, which conceptually can be thought of as 
associating a vector space to the dynamic behaviour of the Petri net. It is interesting 
to graphically interpret the operations involved in linear combinations: Addition and 
multiplication by scalars, as depicted in Fig. 8.1. 

The addition of two transitions is again a transition tk = U+ tj for which input places 
are the addition of input places of every transition and the same for output places. If a 
place appears as input and output place in tk, then it can be removed. 

Multiplication by —1 inverts the transition, i.e. input places become output places 
and viceversa, which in some sense is equivalent to disapplying the transition. 




Fig. 8.1. Linear Combinations in the Context of Petri Nets 

One important issue is that of notation. Linear algebra uses an additive notation 
(addition and subtraction) which is normally employed when an abelian structure is 
under consideration. For non-commutative structures, such as permutation groups, the 
multiplicative notation (composition and inverses) is preferred. The basic operation with 
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productions is the definition of sequences (concatenation) , for which historically a multi- 
plicative notation has been chosen, but substituting composition "o" by the concatenation 
";" operation. 1 

From a conceptual point of view, we are interested in relating linear combinations 
and sequences of productions. 2 Note that, due to commutativity, linear combinations do 
not have an associated notion of ordering, e.g. linear combination PV\ = pi + 2p 2 + ps 
coming from Parikh vector [1, 2, 1] can represent sequences Pi;P2',P3',P2 or p2] P2', P3', Pi , 
which can be quite different. The fundamental concept that deals with commutativity is 
precisely sequential independence. 

Following this reasoning, we can find the problem that makes the state equation 
a necessary but not a sufficient condition: Some transition can temporarily owe some 
tokens to the net. The Parikh vector specifies a linear combination of transitions and 
thus, negatives are temporarily allowed (substraction). 

Proposition 1 Sufficiency of the state equation can only be ruined by transitions tem- 
porarily borrowing tokens from the Petri net. 

Proof 

□If there are enough tokens in every places then the transitions can be fired (equiv., 
productions can be applied). In this case the state equation guarantees reachability. A 
negative number of tokens in one place (temporarily) represents a coherence problem in 
the sequence. Note that due to the way in which Petri nets are defined there can not be 
compatibility issues, hence reachability depends exclusively on coherence. ■ 

In the proof we have used Matrix Graph Grammars concepts such as sequences and 
coherence. Notice that we have not stated how a Petri net is coded in Matrix Graph 
Grammars. This point is addressed in Sec. 8.2. 

Proposition 1 does not provide any criteria based on the topology of the Petri net, 
as theorems 16, 17, 18 and corollaries 2 and 3 in [53], but contains the essential idea in 

1 This is the reason why Chap. 4 introduces ";" to be read from right to left, contrary to the 
Graph Transformation Systems literature. 

2 Linear combinations are the building blocks of vector spaces, and the structure to be kept by 
matrix application. 
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their proofs: The hypothesis in previously mentioned theorems guarantee that cycles in 
the Petri net will not ruin coherence. 



8.2 MGG Techniques for Petri Nets 

In this section we will brush over some of the most important concepts developed so far 
for Matrix Graph Grammars and see how they can be applied to Petri nets. Given a 
Petri net, we will consider it as the initial host graph in our Matrix Graph Grammar. 

One production is associated to every transition in which places and tokens are nodes 
and there is an arrow joining each token to its place. In fact, we represent places for 
illustrative purposes only as they are not strictly necessary (including tokens alone is 
enough). Figure 8.2 shows an example, where production pi corresponds to transition 
The firing of a transition corresponds to the application of a rule. 
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Fig. 8.2. Petri Net with Related Production Set 



Thus, Petri nets can be considered as a proper subset of Matrix Graph Grammars 
with two important properties: 

1. There are no dangling edges when applying productions (firing transitions). 
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2. Every production can only be applied in one part of the host graph. 

Properties (1) and (2) somehow allow us to safely "ignore" matchings as introduced in 
Chap. 5. In addition, we consider Petri nets with no self-loops. 3 Translating to Matrix 
Graph Grammars, this means that one production either adds or deletes nodes of a 
concrete type, but there is never a simultaneous addition and deletion of nodes of the same 
type. This agrees with the expected behaviour of Matrix Graph Grammars productions 
with respect to nodes (which is the behaviour of edges as well, see Sec. 4.1) and will 
be kept throughout the present chapter, mainly because rules in floating grammars are 
adapted depending on whether a given production deletes nodes or not (refer to Sec. 
8.4). 

It is advisable that elements are not relative integers. A number four must mean that 
production p adds four nodes of type x and not that p adds four nodes more than it 
deletes of type x. If we had one such production p, a possible way to proceed is to split p 
into two rules, one performing the addition actions, p r , and the other the deletion ones, 
p e . Sequentially, p should be decomposed as p = p r ;p e - 

Minimal Marking. The concept of minimal initial digraph can be used to find the 
minimum marking able to fire a given transition sequence. For example, Fig. 8.3 shows 
the calculation of the minimal marking able to fire transition sequence ^5^3;^ (from 
right to left). Notice that (fi^i) v (riZ^X^-^) v • ■ ■ v (r\L n ) ■ ■ ■ (r n L n ) is the expanded 
form of equation (4.46). The formula is transformed according to [1 2 3] 1 — > [13 5]. 

p 5 tk 4 m p, tk s m tkp tk s m J 

Fig. 8.3. Minimal Marking Firing Sequence t$\tz\t\ 

Reachability. The reachability problem can also be expressed using Matrix Graph 
Grammar concepts, as the following definition shows. 

3 Petri nets without self-loops are called pure. Petri nets. A place p and a transition t are on a 
self-loop if p is both an input and an output place of t. 
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Definition 8.2.1 (Reachability) For a grammar © = (Mo, {pi, . . . ,p n }), a state Md 
is called reachable starting in state Mq, if there exists a coherent concatenation made up 
of productions Pi e 0 with minimal initial digraph contained in Mo and image in Md- 

This defmtion will be used to extend the state equation from Petri nets to Matrix 
Graph Grammars. 

Compatibility and Coherence. As pointed out in the proof of proposition 1, there 
can not be compatibility issues for Petri nets as no dangling edge may ever happen. 
Coherence of the sequence of transitions firing implies applicability (problem 1). It will 
be possible to unrelate problematic nodes (make the sequence coherent) if there are 
enough nodes in the current state, which eventually depends on the initial marking. 

8.3 Fixed Matrix Graph Grammars 

In this and next sections we will be concerned with the generalization of the state equation 
to wider types of grammars. 

Recall from Sec. 5.1 that by a fixed Matrix Graph Grammar we understand a grammar 
as introduced in Chap. 4, but in which rule application is not allowed to generate dangling 
edges, i.e. any production p that deletes a node but not all of its incoming and outgoing 
edges can not be applied. In other words, operator T e is forced to be the identity. Property 
2 of Petri nets (see Sec. 8.2, p. 213) is relaxed because now a single production may 
eventually be applied in several different places of the host graph. 

The approach of this section can be used with classical Double Pushout graph gram- 
mars [22]. However, following the discussion after proposition 4.1.4, we restrict to DPO 
rules in which nodes (or edges) of the same type are not rewritten (deleted and created) 
in the same rule. 

In order to perform an a priori analysis it is mandatory to get rid of matches. To 
this end, either an approach as proposed in Chaps. 4 and 5 is followed (as we did in Sec. 
8.2), or types of nodes are taken into account instead of nodes themselves. The second 
alternative is chosen so productions, initial state and final state are transformed such 
that types of elements are considered, obtaining matrices with elements in Z. Notice that 
this abstraction provokes some information loss. 
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Tensor notation will be used in the rest of the chapter to extend the state equation. 
Although it will be avoided whenever possible, four indices may be used simultaneously, 
jfA* fe . Top left index indicates whether we are working with nodes (N) or with edges (E). 
Bottom left index specifies the position inside a sequence, if any. Top right and bottom 
right are contravariant and covariant indices, respectively, where k = k 0 is the adjacency 
matrix (with types of elements, as comented above) corresponding to production pk 0 ■ 

Definition 1 Let (3 = (oM, {pi, . . . ,p n }) be a fixed graph grammar and m the number 
of different types of nodes in (3. The incidence matrix for nodes N A = (A£.) where 
i e {1, . . . , n} and k e {1, . . . , m] is defined by the identity 

^ J +r if production k adds r nodes of type i ^ 
) — r if production k deletes r nodes of type i 

It is straightforward to deduce for nodes an equation similar to (8.1): 

n 

?M* = />f + 2 N Alx k . (8.3) 
fe=i 



The case for edges is similar, with the peculiarity that edges are represented by 
matrices instead of vectors and thus the incidence matrix becomes the incidence tensor 
E A-j k . Again, only types of edges, and not edges themselves, are taken into account. Two 
edges e\ and e-i are of the same type if their starting nodes are of the same type and 
their terminal nodes are of the same type. 

Source nodes will be assumed to have a contravariant behaviour (index on top, i) while 
target nodes (first index, j) and productions (second index, k) will behave covariantly 
(index on bottom). See diagram on the center of Fig. 8.5. 

Example. nSome rules for a simple client-server system are defined in Fig. 8.4. There 
are three types of nodes: Clients (C), servers (S) and routers (R), and messages (self-loops 
in clients) can only be broadcasted. 

In the Matrix Graph Grammar approach, this transformation system will behave as 
a fixed or floating grammar depending on the initial state. Note that production p4 adds 
and deletes edges of the same type (C, C). For now, the rule will not be split into its 
addition and deletion components as suggested in Sec. 8.2. See Sec. 8.4.1 for an example 
of this. 
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Fig. 8.4. Rules for a Client-Server Broadcast-Limited System 

Incidence tensor (edges) for these rules can be represented componentwise, each com- 
ponent being the matrix associated to the corresponding production. 



Eai 



"A 



J'3 



0 0 0 
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0 0 1 


R 


0 1 0 


s 


0 2 0 


c 


2 0 0 


R 


0 0 0 


s 



L A 



j 2 



E A 



J4 



0 


-2 0 


c 


-2 


0 -1 


R 


0 


-1 0 


S 




1 0 0 


C 




0 0 0 


R 




0 0 0 


S 



Columns follow the same ordering [C R S]M 

Lemma 8.3.1 With notation as above, a necessary condition for state jM to be reachable 
from state qM is 



d M - 0 M 



S M!. 



2^4 = 2(^)5 



(8.4) 



k = l 



k = l 
p—k 



where i, j e {1, . 



Last equality in equation (8.4) is the definition of inner product - see Sec. 2.4 - so 
we further have: 

d M-oM = ( E A,x). (8.5) 

Proof 

□Consider the construction depicted on the center of Fig. 8.5 in which tensor A^ k is 
represented as a cube. Setting k = kg fixes production pk 0 ■ A product for this object is 
defined in the following way: Every vector in the cube perpendicular to matrix x acts 
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on the corresponding row of the matrix in the usual way, i.e. for every fixed i = io and 
3=30 in (8.4), 




Fig. 8.5. Matrix Representation for Nodes, Tensor for Edges and Their Coupling 



Every column in matrix x is a Parikh vector as defined for Petri nets. Its elements 
specify the amount of times that every production must be applied, so all rows must be 
equal and hence equation (8.6) needs to be enlarged with some additional identities: 

n 

• 3 (8-7) 

V X p - X q 

with p, q e {l,...,m}. This uniqueness together with previous equations provide the 
intuition to raise (8.4). 

Informally, we are enlarging the space of possible solutions and then projecting ac- 
cording to some restrictions. To see that it is a necessary condition suppose that there 
exists a sequence s n such that s n (oM) = and that equation (8.6) does not provide 
any solution. Without loss of generality we may assume that the first column fails (the 
one corresponding to nodes emerging from the first node), which produces an equation 
completely analogous to the state equation for Petri nets, deriving a contradiction. ■ 

Example (Cont'd). nLet's test whether it is possible to move from state So to state 
Sd (see Fig. 8.6) with the productions defined in Fig. 8.4 on p. 217. Matrices for the 
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1:S • 2:C° 

l:C • 

1:R • 3:C 



l :R It 3:C 



Fig. 8.6. Initial and Final States for Productions in Fig. 8. 4 
states (edges only) and their difference are: 



1 0 0 


c 


; ^Sd = 


3 1 0 


c 


0 0 0 


R 


1 0 1 


R 


0 0 0 


s 




0 1 0 


S 



En _ Eq Eq 
O — Dd ~ >->0 



2 1 0 


c 


1 0 1 


R 


0 1 0 


S 



The proof of proposition 8.3.3 poses the following matrices, where the ordering on 
rows and columns is [C R S]: 



Ik 



0 


0 0 1 


• E A l - 


0 


-2 2 0 


• E A { - 


0 


0 0 0 


0 


-2 2 0 


0 


0 0 0 


1 


-10 0 


0 


0 0 0 




1 


-10 0 


0 


0 0 0 



These matrices act on matrix x = (%g), P e {1, 2, 3, 4}, q e {1, 2, 3} to obtain: 

t E A lk x\ = 



fe=i 



0 



E S 2 = 2 E Mkx k 2 



fe=l 



x\ 



(8.8) 



E S 3 



fe=l 



Recall that x must satisfy: 



<3- 



3 ' 



A contradiction is derived for example with equations x 
and 1 = -2x1 + 



x 2' 1 



X3, X2 
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Remark, nlf there is no development tool handy and you need to write equations 
(8.8) it is useful to remember the following rules of thumb: 

• The subscript of S coincides with the subscripts of all x and it is the terminal node 
for edges. Hence, there will be as many equations in Si as types of terminal nodes to 
which modified edges arrive. The first thing to do is a list of these nodes. 

• For a fixed Sj, there will be as many equations in the vector of variables as initial 
nodes for modified edges. The terminal node is j in this case. 

• The superscript of x is the production. To derive each equation just count how many 
edges of the same type are added and deleted and sum up. 

for a larger example see Sec. A.4. ■ 

It is straightforward to derive a unique equation for reachability which considers 
both nodes and edges, i.e. equations (8.3) plus (8.4). This is accomplished extending the 
incidence matrix M from M : E — » E to M : E x N — > E (from M mxm to M mx ( m+ i)), 
where column m + 1 corresponds to nodes. 

Definition 8.3.2 (Incidence Tensor) Let © = (qM, {pi, . . . ,p n }) be a Matrix Graph 
Grammar. The incidence tensor Aj k with ie {1, . . . , m} and j e {1, . . . ,m + 1} is defined 
by (84) ifl^j^m and by (8.3) if j =m + l. 

Top left index in our notation works as follows: N A refers to nodes, E A to edges and A 
to their coupling. Note that a similar construction can be carried out for productions if it 
was desidered to consider nodes and edges in a single expression. Almost all the theory as 
developed so far would remain without major notational changes. The exception would 
probably be compatibility, which would need to be rephrased. 

An immediate extension of lemma 8.3.1 is: 

Proposition 8.3.3 (State Equation for Fixed MGG) Let notation be as above. A 
necessary condition for state to be reachable (from state qM ) is: 



n 




(8.9) 



k=i 



Proof 
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Equation (8.9) is a generalization of (8.1) for Petri nets. If there is just one place 
of application for each production then the state equation as stated for Petri nets is 
recovered. 

8.4 Floating Matrix Graph Grammars 

Our intention now is to relax the first property of Petri nets (Sec. 8.2, p. 213) and allow 
production application even though some dangling edge might appear (see Chap. 5). 
The plan is carried out in two stages which correspond to the subsections that follow, 
according to the classification of e-productions in Sec. 5.4. 

In Matrix Graph Grammars, if applying a production po causes dangling edges then 
the production can be applied but a new production (a so-called e-production) is created 
and applied first. In this way a sequence po;p £ o is obtained with the restriction that p £ o 
is applied at a match that includes all nodes deleted by po- See Chap. 5 for details. 

Inside a sequence, a production p 0 that deletes an edge or node can have an external or 
internal behaviour, depending on the identification carried out by the match. Following 
Chap. 5, if the deleted element was added or used by a previous production the production 
is labeled as internal (according to the sequence). On the other hand, if the deleted 
element is provided by the host graph and it is not used until po's turn, then p 0 is an 
external production. 

Their properties are (somewhat) complementary: While external e-productions can 
be advanced and composed to eventually get a single initial production which adapts the 
host graph to the sequence, internal e-productions are more static 4 in nature. On the 
other hand, internal e-productions depend on productions themselves and are somewhat 
independent of the host graph, in contrast to external e-productions. Note however that 
internal nodes can be unrelated if, for example, matchings identified them in different 
parts of the host graph, thus becoming external. 

4 Maybe it is possible to advance their application but, for sure, not to the beginning of the 
sequence. 
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8.4.1 External e-production 

The main property of external e-productions, compared to internal ones, is that they act 
only on edges that appear in the initial state, so their application can be advanced to the 
beginning of the sequence. In this situation, the first thing to know for a given Matrix 
Graph Grammar (3 = (oM, {pi, . . . ,p n }) - with at most external e-productions - when 
applied to oM is the maximum number of edges that can be erased from its initial state. 
The potential dangling edges (those with any incident node to be erased) are given by 

n 

* = V (F®F) ( 8 - 10 ) 

k = l 

This is the nihilation matrix introduced in Sec. 4.4.2, in particular in lemma 4.4.5. 
The notation is changed in order to distinguish N of nodes from N of nihilation. 

Proposition 8.4.1 A necessary condition for state dM to be reachable (from state $M ) 
is: 

n 

with the restriction $M e ^ 6- ^ 0. 
Proof (Sketch) 

□According to Sec. 5.4, all e-productions can be advanced to the beginning of the sequence 
and can be composed to obtain a single production, adapting the initial digraph before 
applying the sequence, which in some sense interprets matrix b as the production number 
n+1 in the sequence (the first to be applied). Because it is not possible to know in advance 
the order of application of productions, all we can do is to provide bounds for the number 
of edges to be erased. This is in essence what b does.B 

Note that equation (8.9) in proposition 8.3.3 is recovered from (8.11) if there are no 
external e-productions. 

Example. ^Consider the initial and final states shown in Fig. 8.7. Productions of 
previous examples are used, but two of them are modified (p 2 and p 3 ). 

In this case there are sequences that transform state 0-5* in dS, for example, S4 = 
P4]p' 3 ;pi;p' 2 . Note that the problems are in edges (1 : S, 1 : R) and (1 : C, 1 : R) of the 
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Fig. 8.7. Initial and Final States (Based on Productions of Fig. 8.4) 



intial state: Router 1 is able to receive packets from server 1 and client 1, but not to send 
them. 

Next, matrices for the states and their difference are calculated. The first three 
columns correspond to edges (first to clients, second to routers and third to servers) 
and fourth to nodes which has been split by a vertical line for illustrative purposes only. 
The ordering of nodes is [C R S] both by columns and by rows. 
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The incidence tensors for every production (recall that P2 and ps are as in Fig. 8.7) 
have the form 
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Although it does not seem to be strictly necessary here, more information is kept and 
calculations are more flexible if production p4 is split into the part that deletes messages 



A i+ 



and the part that adds them, p4 = 
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As in the example of Sec. 8.3, the following matrices are more appropriate for calcu- 
lations: 



~>3k " 



o o o -: 

0 0 10 

ooo o 



^2k 



0 0 0 0 0 

1 0 0 0 0 
0 0 0 0 0 



4 k 



0 0 10 0 

0 0 0 0 0 

1 0 0 0 0 

0 0 0 0 0 
1-10 0 0 
0 0 0 0 0 



If equation (8.9) was directly applied, we would get x 1 = 0 and x 1 = 1 (third row of 
A^ k and second of A^ k ) deriving a contradiction. The variations permitted for the initial 



state are given by the matrix 



oMc 



0 a^00 
a? 0 00 
0 a\ 0 0 



with a\ e {0, -1}, a\, a% e {0, -1, -2}. Setting b\ 
and one edge (C, R) removed) the system to be solved is 



1 and b 3 



(8.12) 

-1 (one edge (S,R) 



110 0 




-x i +2x 4 x 3 


0 


0 


10 10 




x 3 0 


x 1 


x 1 - x 2 


0 10 0 




0 x 1 


0 


0 



with solution x 1 = x 2 = x 3 = x = 1, S4 being one of its associated sequences. Note that 
the restriction in proposition 8.4.1 is fulfilled, see equation (8.12).B 

In previous example, as we knew a sequence (S4) answer to the reachability problem, 
we have fixed matrix b directly to show how proposition 8.4.1 works. Although this 
will not be normally the case, the way to proceed is very similar: Relax matrix M by 
substracting b, find a set of solutions {x, b} and check whether the restriction for matrix 
b is fulfilled or not. 



8.4.2 Internal £-production 

Internal e-productions delete edges appended or used by productions preceding it in the 
sequence. In this subsection we first limit to sequences which may have only internal 
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£-productions and, by the end of the section, we will put together proposition 8.4.1 from 
Subsec. 8.4.1 with results derived here to state theorem 8.4.3 for floating Matrix Graph 
Grammars. 

The proposed way to proceed is analogous to that of external £-productions. The 
idea is to allow some variation in the amount of edges erased by every production, but 
this variation is constrained depending on the behaviour (definition) of the rest of the 
rules. Unfortunately, not so much information is gathered in this case and what we are 
basically doing is ignoring this part of the state equation. 

Define h] k = ^A* k (e®I fe )j = max (A(e (x) I), 0), where vector I k = [1, . . . , l](i,fc)- 5 

Proposition 8.4.2 A necessary condition for state to be reachable (from state oMj 
is: 

n 

M i = Y i {^k+V)x k (8.13) 
fc=i 

with the restriction h l - k ^V-\ ^ 0. 

Proof 
□■ 

In some sense, external e-productions are the limiting case of internal e-productions 
and can be seen almost as a particular case: As e-productions do not interfere with 
previous productions they have to act exclusively on the host graph. 

The full generalization of the state equation for non-restricted Matrix Graph Gram- 
mars is given in the next theorem. 

Theorem 8.4.3 (State Equation) With notation as above, a necessary condition for 
state dM to be reachable (from state oMj is 

n 

M / = Z (4* + *0 + ^ (8.14) 

k=l 

with bj satisfying restrictions specified in proposition 8.4-1 and V satisfying those in 
proposition 8.4-2. 

5 e(x)I(fc) defines a tensor of type (1,2) wich "repeats" matrix e "k" times. 
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Proof 
□■ 

One interesting possibility of 8.14 is that we can specify if productions acting on some 
edges must have a fixed or floating behaviour, depending whether variances are permitted 
or not. 

Strengthening hypothesis, formula (8.4.3) becomes those already studied for floating 
grammars with internal e-productions (6 = 0), with external e-productions (V = 0), fixed 
grammars (from multilinear to linear transformations) or Petri nets, fully recovering the 
original form of the state equation. 

8.5 Summary and Conclusions 

The starting point of the present chapter is the study of Petri nets as a particular case 
of Matrix Graph Grammars. We have adapted concepts of Matrix Graph Grammars to 
Petri nets, such as initial marking. Next, reachability and the state equation have been 
reformulated and extended with the language of this approach, trying to provide tools 
for grammars as general as possible. 

Matrix Graph Grammars have also benefitted from the theory developed for Petri 
nets: Through the generalized state equation (8.14) it is possible to tackle problem 4. 

Despite the fact that the more general the grammar is, the less information the state 
equation provides, theorem 8.4.3 can be considered as a full generalization of the state 
equation. 

Equation (8.14) is more accurate as long as the rate of the amount of types of nodes 
with respect to the amount of nodes approaches one. Hence, in general, it will be of little 
practical use if there are many nodes but few types. 

Although the use of vector spaces (as in Petri nets) and multilinear algebra is almost 
straightforward, many other algebraic structures are available to improve the results 
herein presented. For example, Lie algebras seem a good candidate if we think of the Lie 
bracket as a measure of commutativity (recall Subsec. 8.1 in which we saw that this is 
one of the main problems of using linear combinations). 
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It should be possible to extend a little the Lie bracket to consider two sequences 
instead of just two productions. 6 With the theory of Chap. 6 the case of one production 
and one sequence can be directly addressed. 

Other Petri nets concepts have algebraic characterizations and can be studied with 
Matrix Graph Grammars. Also, it is possible to extend their definition from Petri nets 
to Matrix Graph Grammars. A short summary of some of them follows: 

• Conservative Petri nets are those for which the sum of the tokens remains constant. 
For example, think of tokens as resources of the problem under consideration. 

• An invariant is some quantity that does not change during runtime. They are divided 
in two main families: Place invariants and transition invariants. 

• Liveness studies whether transitions in a Petri net can be fired. There are five levels 
(LO to LA) with algebraic characterizations of necessary conditions. 

• Boundedness of a Petri net studies the number of tokens in places (in particular if 
this number remains bounded). Sufficient conditions are known. 

Note that reachability can be directly used to study invariance under sequences of 
initial states. If the initial state must not change, set the initial and the final states as 
one and the same. This way, the state equation must be equalized to zero. This is related 
to termination because if there are sequences that leave some state invariant, then there 
are cycles in the execution of the grammar, preventing termination. 

The book finishes in Chap. 9, a summary with further research proposals. Also, the 
main contributions are listed. 

Appendix A presents a full worked out example that illustrates all relevant concepts 
presented in this dissertation in a more or less realistic case. Its main objective is to show 
the use and practical utility of compatibility, coherence, minimal and negative initial 
digraphs, applicability, sequential independence and reachability. In particular properties 
of the system related to problems 1, 3 and 4 are addressed. 



If sequences are coherent, composition can be used to recover a single production. 
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Conclusions and Further Research 



This chapter closes the main body of the book. There are still two appendices. Appendix 
A includes a detailed real world case study in which much of the theory developed so 
far is applied. It is of interest because we can see the theory at a practical level and the 
power of the results. Appendix B includes the ICM'2006 award wining poster in which 
the basics of Matrix Graph Grammars are illustrated. 

This chapter is organized in three sections. In Sec. 9.1 we summarize the theory and 
highlight some topics that can be further investigated with Matrix Graph Grammars as 
developed so far. Section 9.2 lists what in our understanding are the main contributions 
of this book. Finally, Sec. 9.3 exposes a long term program to address termination, 
confluence and complexity from the point of view of Matrix Graph Grammars. 

9.1 Summary and Short Term Research 

In this book we have presented a new theory to study graph dynamics. Also, increas- 
ingly difficult problems of graph grammars have been addressed: Applicability, sequential 
independence and reachability. 

First, two characterizations of action over graphs (also production or grammar rule) 
are defined, one emphasizing its static part and one its dynamics. To some extent it is 
possible to study these actions without taking into account the initial state of the system. 
Hence, information on the grammar can be gathered at design time, being potentially 
useful during runtime. Nodes and edges are considered independently, although related 
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by compatibility. It should be possible, using the tensorial construction of Chap. 8, to 
define a single (algebraic) structure and set compatibility as one of its axioms (a property 
to be fulfilled). 

Sequences of productions are studied in great detail as they are responsible for the 
dynamics of any grammar. Composition, parallelism and true concurrency have also been 
addressed. 

The effect of a rule on a graph depends on where the rule is applied (matching). In 
Matrix Graph Grammars, matches are injective morphisms. As different productions in 
a sequence can be applied at different places non-detcrministically, marking links parts 
of productions guaranteeing their applicability to the same elements. It is possible to 
define both matching and marking as operators acting on productions. 

Production application may have side effects, e.g. the deletion of dangling edges. A 
special type of productions, known as e-productions, appear to keep compatibility. It 
is shown that they are the output of some operator acting on productions as well as 
matching and marking. 1 Operators can be translated into productions of a sequence. 
This new perspective eases their analysis. 

Minimal and negative initial digraphs are respectively generalized to initial and neg- 
ative digraph sets. Two characterizations for applicability are given. One depends on 
coherence and compatibility and the other on minimal and negative initial digraphs. 

Sequential independence is closely related to commutativity, but with the possibility 
to consider more than two elements at a time. This has been studied in the case of one 
production being advanced or delayed an arbitrary (but finite) number of positions. 

One interesting question is whether two sequences need the same initial elements 
or not, especially when one is a permutation of the other. G-congruence and congru- 
ence conditions tackle this point again for one production being advanced or delayed a 
finite number of positions inside a sequence. An interesting topic for further study is 
to get similar results but considering moving blocks of productions instead of a single 
production. 

1 Compatibility is a must. The operator may act appending new e-productions, recovering a 
floating behaviour or it can be "deactivated" getting a fixed behaviour. Throughout this book 
we have focused on floating grammars, which are more general. 
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Graph constraints and particularly application conditions are of great interest, mainly 
for two reasons: First, the left hand side and the nihilation matrix are particular cases, 
and second it is possible to deal with multidigraphs without any major modifications 
of the theory. We have seen that application conditions are a particular case of graph 
constraints and that a graph constraint can be reduced to an application condition in the 
presence of a production. Application conditions can again be seen as operators acting on 
productions. This, once more, means that they are equivalent to sequences of a certain 
kind. 

Any application condition can be transformed into an equivalent (set of) produc- 
tion^). Among many other things, this reduces the study of consistency of application 
conditions to that of applicability. As it is possible to transform preconditions into post- 
conditions and back again, they are in some sense delocalized in a production. Although 
this is sketched in some detail in Chap. 7, no concrete theorem is established concern- 
ing the possibility to move application conditions among productions inside a sequence. 
We do not foresee, to the best of our knowledge, any special difficulty in addressing 
this topic with the theory developed so far. This would be one application of sequential 
independence - problem 3 - to application conditions. 

Finally, in order to consider reachability - problem 4 - Petri nets are presented as 
a particular case of Matrix Graph Grammars. From this perspective, notions of Matrix 
Graphs Grammars like the minimal initial digraph are directly applied to Petri nets. 
Also it is interesting that concepts and results from Petri nets can be generalized to be 
included in Matrix Graph Grammars. Precisely, one example of this is reachability. Some 
other concepts can also be investigated such as liveness, boundedness, etcetera, and are 
left for future work. 

For our research in reachability we have almost directly generalized previous ap- 
proaches (vector spaces) to reachability by using tensor algebra. It is worth studying 
other algebraic structures such as Lie algebras. Also, our study of reachability has not 
taken into account the nihilation matrix nor application conditions, other two possible 
directions for further research. 



232 9 Conclusions and Further Research 



9.2 Main Contributions 

In our opinion, the main contribution of this book is the novelty of the graph grammar 
representation, simple and powerful. It naturally brings in several branches of mathemat- 
ics that can be applied to Matrix Graph Grammars, allowing a potential use of advanced 
results to solve old and new problems: First and second order logics, group theory, tensor 
algebra, graph theory, category theory and functional analysis. 

In this book we have made some contributions that we would like to highlight, together 
with the chapter or section in which they are addressed: 

1. Algebraic characterization of productions: 

• Static and dynamic formulations of grammar rules (Sees. 4.1 and 4.4), which allow 
studying rules independently (to some extent) of any host graph. 

• Compatibility for graphs (Sec. 2.3), rules (Sec. 4.1) and sequences (Sec. 4.5), as 
well as coherence for sequences (Sec. 4.3). 

• Minimal and Negative initial digraphs (together with the nihilation matrix) de- 
fined and characterized for sequences. Sections 4.4.1 and 4.4.2. This is later gen- 
eralized as initial and negative digraph sets in Sec. 5.3. 

• Composition and its relation with concatenation are addressed. Section 4.5. 

2. Rewriting step: 

• The operation of matching is characterized as an operator T e acting on produc- 
tions (extended match). Chapter 5. 

• Matrix Graph Grammars can have a fixed or a floating behaviour, depending on 
operator T e , allowing the definition of e-productions. Side effects are characterized 
as a sequence of productions. Chapter 5. 

• The marking operation can be also characterized as an operator, T M . It makes it 
possible to get rid of non-determinism when splitting productions into sequences. 
Section 5.2. 

• Grammar rule side effects are classified in internal and external e-productions. 
The concept of exact derivation is introduced. Section 5.4. 

• Two alternative characterizations are provided for applicability. Section 5.5. 

3. Analysis of rule independence: 
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• Characterization of the equality between initial digraphs is carried out through 
G-congruence, together with congruence conditions. Section 6.1. 

• Necessary and sufficient conditions for sequential independence of sequences and 
derivations are proved. Sequential independence is extended to cope with the 
simultaneous advance (delay) of an arbitrary finite number of productions inside 
a sequence. Sections 6.2 and 6.3. 

4. Graph constraints and application conditions: 

• Graph constraints and application conditions are introduced using MSOL (Sec. 
7.1). A characterization of consistent, coherent and compatible application con- 
ditions is given (Sec. 7.2). 

• A functional analysis representation of application conditions is given, which al- 
lows establishing an equivalence between application conditions and certain sets 
of sequences (Sec. 7.3). 

• Preconditions can be transformed into equivalent postconditions and viceversa. 
Productions are capable of transforming not only the LHS of a grammar rule, L, 
but also its nihilation matrix N . Section 7.4. 

• Variable nodes can be used as part of the graph constraint or application condition 
(not limited to constant nodes). Matrix Graph Grammars can deal with no major 
modifications with multidigraphs. Section 7.5. 

5. Reachability and relation with Petri net theory: 

• Petri nets can be seen as a particular case of Matrix Graph Grammars. MGG 
theory can be applied to Petri nets. Section 8.2. 

• ft is possible to extend Petri nets results to Matrix Graph Grammars. As an 
example, the state equation for reachability in Petri nets is generalized. Chapter 
8. 
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On the practical side, as Appendix A shows, some tasks need to be automated to ease 
further research. Manipulations can get rather tedious and error prone. The develop- 
ment or improvement of a tool such as AToM 3 would be very valuable. Besides, a good 
behaviour of an implementation of Matrix Graph Grammars is expected. 
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At a more theoretical level we propose to study other three increasingly difficult 
problems: Termination, confluence and complexity. We think that the theory developed 
in this book can be useful. See Fig. 9.1. 



Sequential 
Independence 



Independence 



Sequential 
Confluence 



Confluence 



Applicability 



Termination 



Reachability 



Complexity 



Fig. 9.1. Diagram of Problem Dependencies. 



Termination, in essence, asks whether there is a solution for a given problem (if some 
state is reached). In other branches of mathematics this is the well-known concept of 
existence. Reachability with some improvements can be of help in two directions. Starting 
in some initial state, if for some sequence of productions some invariant state is reached, 
then the grammar can not be terminating (as it enters a cycle as soon as it is reached). 
Second, to test the invariance for a given state (if there exists some sequence that leave 
the graph unaltered), the state equation can also be used by setting equal initial and 
final states. 

If we have a terminating grammar, we may wonder whether there is a single final state 
or more than one: Confluence. In other branches of mathematics this is the well-known 
concept of uniqueness. Sequential independence can be used in this case. 

If a grammar is terminating and confluent, the next natural question seems to be how 
much it takes to get to its final state. This is complexity, which can also be addressed 
using Matrix Graph Grammars. It is not difficult to interpret Matrix Graph Grammars 
as a new model of computation, just as Boolean Circuits [76] or Turing machines [57]. 
This is currently our main direction of research. 
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Notice that there are two properties that make Matrix Graph Grammars differ from 
standard Turing machines: Its potential non-uniformity (shared with Boolean Circuits) 
and the use of an oracle, in its strongest version, whose associated decision problem is 
NP-complete. 

Non- uniformity is widely addressed in the theory of Boolean Circuits. The same ideas 
possibly with some changes can be applied to Matrix Graph Grammars. 

The strongest version of Matrix Graph Grammars as introduced here use an oracle 
whose associated decision problem is NP-complete: The subgraph isomorphism problem, 
SI, to match the left hand side of a production in the host graph. If problems that need to 
distinguish lower level complexity classes (assuming P^NP) such as P are considered, 
it is possible to restrict ourselves to some proper submodel of computation. For example, 
the match operation can be forced to use GI instead. 2 

Limitations on matching are not the only natural submodels of Matrix Graph Gram- 
mars. The permitted operations can be constrained, for example forbidding the addition 
and erasing of nodes (this would be closely related to non-uniformity and the use of a 
Gl-complete problem rather than SI). Also, we can act on the set of permitted graphs to 
derive submodels of computation. For example, consider only those graphs with a single 
incoming and a single outgoing edge in every node. 3 



2 GI, Graph Isomorphism, is widely believed not to be NP-complete, though this is still a 

conjecture. Problems that can be reduced to GI define the complexity class GI. 

3 By the way, what standard and very well known mathematical structure is isomorphic to 
these graphs?. 



A 



Case Study 



This Appendix presents a full worked out example that illustrates many of the concepts 
and results of this book (more conceptual aspects such as functional representations, 
adjoints and the like are omitted in this appendix). Although the aim of Matrix Graph 
Grammars is to be a theoretical tool for the study of graph grammars and graph trans- 
formation systems, we will see that it is also of practical interest. 

The case study herein presented tries to be simple enough to be approached with 
paper and pencil but complex enough to look realistic. 

As will be noticed throughout this chapter, Matrix Graph Grammars (as well as any 
approach to graph transformation) encourages the definition of a particular language to 
solve a particular problem. These are known as Domain-Specific languages (DSL). See 
[35]. 

Section A.l presents an assembly line with four types of machines (assembler, dis- 
assembler, quality and packaging), one or more operators and some items to process. 
Section A. 2 presents some sequences and derivations, together with possible states of 
the system. Section A. 3 tackles minimal and negative initial digraphs and G-congruence. 
As we progress, the example will be enlarged to be more detailed. Section A. 6 returns 
to derivations, adding and modifying productions. Section A. 4 deals with applicability, 
sequential independence, reachability and confluence. Graph constraints and applica- 
tion conditions are exemplified in Sec. A. 5. Dangling edges and their treatment with 
e-productions will show up throughout the case study. 
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A.l Presentation of the Scenario 

In this section our sample scenario is set up. Some basic concepts will be illustrated: 
Matrix representation of graphs and productions (Sec. 4.1), compatibility (Sees. 2.3, 4.1 
and 4.5), completion (Sec. 4.2) and the nihilation matrix (Subsec. 4.4.2). 

Our initial assembly line will consist of four machines that take as input one or more 
items and output one or more items. Depending on the machine, items are processed 
transforming them into other items or some decission is taken (reject, accept items), 
with no modification. 

There are four types of items, iteml - item4. One assembly machine (named 
assembler, connected to two input conveyors) processes one piece of iteml and one 
piece of item2 to output in another conveyor one piece of type item3. There is a quality 
assurance machine - quality - that checks if item3 fulfills certain quality standards. If 
it does, then item3 is accepted and packed to further produce item4 through a packaging 
machine. On the contrary, it is rejected and recycled thorugh machine disassembler. 
Elements are graphically represented in Fig. A.l. 



g ♦ 3 a m 

iteml item2 item3 item4 conveyor 




assembler disassembler quality packaging operator 



Fig. A.l. Graphical Representation of System Actors 

In this case study types are those in Fig. A.l. There can be more than one element of 
each type, e.g there are six elements of type conveyor in Fig. A. 6, which shows a snapshot 
of the state of an example of assembly line. For typing conventions refer to comments on 
the example of page 75. 

Note that for now conveyors have infinite load capacity, elements in a conveyor are not 
ordered and one operator can simultaneously manage two or more machines. It should 
be desiderable that one operator might look after different machines but only one at a 
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time. This can be guaranteed only with graph constraints although if the initial state 
fulfills this condition and productions observe this fact, there should not be any problem. 
We will return to this point in Sec. A. 5. 




Fig. A. 2. DSL Syntax Specification 

When dealing with DSLs, it is customary to specify its syntax through a meta-model. 
We will restrict connections among the different actors of the system: 

• Operators can only be connected to machines (by the end of Sec. A. 2 this will be 
relaxed). 

• Items can only be connected to conveyors (until Sec. A. 5 in which they will be allowed 
to be connected to other items). 

• Conveyors can only be connected to machines or to other conveyors. 

• Machines can be connected only to conveyors (by the end of Sec. A. 2 this will be 
relaxed). 

These restrictions have a natural graph representation (see Fig. A. 2), which is some- 
times referred as typed graphs, [10]. Notice that for simplicity all actual types have 
been omitted. For example, there should be four nodes for the different types of items 
(iteml, . . . , item4), and the same for the machines. 

Now we describe the actions that can be performed. These are the grammar rules. 
The state machine will evolve according to them. See Fig. A. 3 for the basic productions. 
We will enlarge or amend them and add some others in future sections. 

Machines are not fully automatic so in this four productions one operator is needed. 
The four basic actions are assemble, disassemble, certify and pack. They correspond to 
productions assemble, recycle, certify and pack. Identifications are obvious so they 
have not been made explicit (numbers between different productions need not be related, 
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Fig. A. 3. Basic Productions of the Assembly Line 

i.e. 1 : conv in production assem and 1 : conv in certify can be differently identified in 
a host graph). 

There are four rules that permit operators to change from one machine to another. 
This movement is cyclic (to make the grammar a little bit more interesting) . A practical 
justification could be that the responsible of the department obligues every operator 
passing near a machine to check if there is any task pending, attending it just in case. 
We will start with a single operator to avoid collapses. See grammar rules move2A, move2Q, 
move2P and move2D in Fig. A. 4. 




Fig. A. 4. Productions for Operator Movement 



A.l Presentation of the Scenario 241 



The last set of productions specify machines and operators break-down (the 'b' in 
front of the productions). Fortunately for the company they can be fixed or replaced 
(the 'f in front of the productions). See Fig. A. 5 for the productions, where as usual 0 
stands for the empty graph. In order to save some space we have summarized four rules 
(one per machine) substituting the name of the machine by an X. This is notationally 
convenient but we should bear in mind that there are four rules for machines break down 
(bMachA, bMachQ, bMachP and bMachD) and another four for machines fixing (fMachA, 
fMachQ, fMachP and fMachD). Also, they can be thought of as abstract rules 1 or variable 
nodes as in Sec. 7.5. The total amount of grammar rules up to now is twenty. 
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Fig. A. 5. Break-Down and Fixing of Assembly Line Elements 



Here we face the problem of e-productions for the first time. If a conveyor with two 
items breaks (disappears) due to rule bConv, there will be at least two dangling edges, 
one from its input machine and another to its output machine. These dangling edges 
could be avoided defining one production per conveyor that takes them into account. 
If the conveyor had any item, then the corresponding edge would also dangle. Again 
this can be avoided if there is a limit in the number of pieces that a conveyor can 
carry, but a rule for each one is again needed. 2 Another possibility for DPO-like graph 
transformation systems (what we have called fixed graph grammars) is to define a sort of 
subgrammar that takes care of potential dangling edges. This subgrammar productions 
would be applied iteratively until no edge can dangle. This is a characteristic of fixed 

1 See reference [46]. 

2 A rule for the case in which a conveyor has one item, another for the case in which the 
conveyor has two items, etcetera. 
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graph tranformation systems and in some situations can be a bit annoying. If there 
is no limit to the number of items (or the limit is too high, e.g. a memory stack in 
a CPU RAM), it is possible to use fixed graph grammars only to some extent. Thus, 
e-productions are useful - at times essential - from a practical point of view, among 
other things, to decrease the number of productions in a grammar (this probably eases 
grammar definition and maintenance and increases runtime efficiency). 

Matrix representation of these rules is almost straightforward according to Sec. 4.1. 
We will explicitcly write the static (left and right hand sides) and dynamic representations 
(deletion, addition and nihilation matrices) of production assemble. 

Elements are ordered [1 : iteml 1 : item2 1 : conv 2 : conv 3 : conv 1 : macA 1 : op] for 
L E sscm and L^, sem , i.e. element (1,3) of matrix L E ssem is the edge that starts in node 
(1: iteml) and ends in first conveyor, (l:conv). The ordering for productions R^ sem 
and i?^ sem is [l:item3 1 : conv 2: conv 3: conv l:macA 1 : op] . Numbers in front of 
types are a means to distinguish between elements of the same type in a given graph 
(these are the numbers that appear in Fig. A. 3). 
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For e E , e N , r E and r N we have the same ordering of elements. 



0 0 10 0 0 0 

0 0 0 10 0 0 

0 0 0 0 0 0 0 

0 0 0 0 0 0 0 

0 0 0 0 0 0 0 

0 0 0 0 0 0 0 

0 0 0 0 0 0 0 



0 0 0 10 0 

0 0 0 0 0 0 

0 0 0 0 0 0 

0 0 0 0 0 0 

0 0 0 0 0 0 

0 0 0 0 0 0 



The production is defined R = p(L) = rveL both for edges and for nodes. To operate 
it is mandatory to complete the matrices. See equation (A. 2) for the implicit ordering of 
elements. 
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(A.l) 



The expression for nodes is similar. As pointed out in Sec. 8.5, using a similar con- 
struction to that of Sec. 8.3 (in the definition of the incidence tensor 8.3.2) it should be 
possible to get a single expression for both nodes and edges instead of a formula for edges 
and a formula for nodes. This might be of interest for implementations of Matrix Graph 
Grammars as more compact expressions are derived. 

We will normally concentrate on edges because they define matrices instead of just 
vectors and all problems such as inconsistencies (dangling elements) come this way. 
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(A.2) 
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Note that some elements in the node vectors are zero. This means that these nodes 
appear in the algebraic expressions but are not part of the graphs. 

The nihilation matrix in this case includes all edges incident to any node that is 
deleted plus edges that are added by production assem. See lemma 4.4.5 for its calculation 
formula: 
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(A.3) 



Let's consider sequence b0p;assem to see how formula (2.4) works to check compati- 
bility (propositions 2.3.4 and 4.1.6). We can foresee a problem with edge (1 : op, 1 : machA) 
because the node disappears but not the edge. 

According to (4.65) we need to check compatibility for the increasing set of sequences 
si = assem and s2 = b0p;assem. Note that the minimal initial digraph is the same 
for both sequences and coincides with the left hand side of assem. Sequence assem 
is compatible, as the output of production assem is a simple digraph again, i.e. rule 
assemble is well defined: 
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Thus, there is no problem with si. Let's check S2 out. Operations are also easy for it. 
Note that r b0p v e b0p (R E sssm ) ~ 1 " 
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This kind of formulas do not only assert compatibility for the sequence, but also tell 
us which elements are problematic. In previous equation we see that the final answer is 
1 because of element in position (7, 8) (bolded). 

In our case study as defined up to now, compatibility can only be ruined by produc- 
tions starting with a 'b' (bOp, etcetera). Either an e-production is appended or the result 
is not a simple digraph (not a graph, actually). Some information about compatibility 
can be gathered at design time, on the basis of required elements appearing on the left 
hand side of the productions, or elements added. For example, according to productions 
considered so far any operator is connected to some machine so if production bOp is 
applied it is very likely that some dangling edge will appear. Nihilation matrices can be 
automatically calculated as well as completion of rules with respect to each other. 

A typical snapshot of the evolution of our assembly line can be found in Fig. A. 6. It 
will be used in future sections as initial state. 
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Fig. A. 6. Snapshot of the Assembly Line 



A.2 Sequences 

One topic not addressed in previous sections is how rules in a graph grammar are selected 
for its application to an actual host graph. There are several possibilities. To simplify 
the exposition rules will be chosen randomly. As commented in Sees. 5.1 and 7.5, this is 
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the first - out of two - source of non-determinism in graph transformation systems, in 
particular in Matrix Graph Grammars. 

We will add another rule - reject - that discards one element once it has been 
assembled. It is represented in Fig. A. 7. 




Fig. A. 7. Graph Grammar Rule reject 



We have two comments on this rule. First, reject does not need the presence of an 
operator to act, but it may also be applied if an operator is on the machine. Second, 
if grammar rules are applied randomly following some probability distribution, elements 
will be rejected according to the selected probability measure. 

Let's begin with one sequence that starts with one piece of type iteml and one of 
type item2 and produces one of type item4: 

s 0 = pack; certify ; assem (A. 4) 

which is compatible as no production generates any dangling edge. Recall that compati- 
bility also depends on the host graph: If iteml was connected to two different conveyors 
(should this make any sense) then rule assem would produce one dangling edge. 

The minimal initial digraph of s 0 can be calculated using (4.46), M So = Vf (r^L y ), 
where order of nodes is [1: iteml l:item2 l:item3 l:item4 1 : conv 2:conv 3:conv 
4:conv 5: conv l:machA l:machQ l:machP 1 : op] . The completion we have performed 
identifies operators in the productions as being the same. Also, element 1 : conv in rule 
certify (Fig. A. 3) becomes 3: conv and 2: conv is now 4: conv. Similar manipulations 
have been performed for pack. Theorem 4.4.2 demands coherence in order to apply (4.46), 
which is checked out in (A. 7). More attention will be paid to initial digraphs in the next 
section. 
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Fig. A. 8. Minimal Initial Digraph and Image of so 
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(A.5) 



The negative initial digraph is calculated using (4.62), N Si 



(exN v ) 



It is not 



shown in any figure because it has many edges. In order to calculate N So , the nihilation 
matrices of productions assem (A. 3), certify and pack are needed. Equation (4.60), 
N = p (D) , can be used with the same ordering of nodes as for M So . 
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The result of applying s 0 to M So is given by (4.55), s 0 = /\f =1 (ef M^j v 

Af (e^ r^j, and can be found on the right of Fig. A. 8. For its calculation, it is possible 
to interpret s 0 as a production according to the remark that appears right after (4.55). 

Sequence s 0 is coherent according to the identifications proposed in its minimal initial 
digraph (Fig. A. 8). To see this (4.42) in theorem 4.3.5 can be used, which once simplified 
is (4.38): 

-^cert^assem V -Zvpack (^assem ^*cert V Ccert) V 

V -Rassem (Scert^pack V ''cert) V -Rcert^pack = 0. {A.7) 

A very simple non-coherent sequence - assuming that both rules act on the same 
elements — is to = reject; certify. It is obvious as both consume the same item. When 
calculating its coherence, not only will we be informed that coherence fails but also what 
elements are responsible for this failure. 

According to Prop. 4.5.3 the rules in s 0 can be composed if they are coherent and 
compatible. Let cq = (L c , e c , r c ) be the rule so defined. Using equations (4.67) and (4.68) 
its matrices can be found. Also, taking advantage of previous calculations for the image 
and according to corollary 4.4.3, we can see that the composition is the one given in Fig. 
A. 9, closely related to Fig. A. 8. 




Fig. A. 9. Composition of Sequence so 



Let mvi = move2A; move2D and mv 2 = move2P; move2Q and define the sequence 
S4 = pack; mv 2 ; assem; mvi. Production pack is not sequentially independent of mvi nor 
of mv 2 ; assem. This is a simple example in which it is possible to advance productions 
inside sequences only if jumps of length strictly greater than one are allowed. To see that 
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pack_L (mv 2 ; assem;mvi) it is necessary - according to theorem 6.2.2 - to test coherence 
of both sequences and G-congruence. 

Coherence for advancement of a single production inside a sequence is given by (6.30) 
in theorem 6.2.3, which should be zero. It is straightforward to check that: 

Gpack Vl (lx L v) V Rpzck Vl (^ r v) = 0- (A. 8) 




Fig. A. 10. DSL Syntax Specification Extended 



By increasing the number of productions the system can be modelled in greater detail. 
For example, one operator can be busy or idle. The operator is busy if some action needs 
his attention. This will be represented by a self loop attached to the operator under 
consideration. The same applies to a machine. The syntax as a DSL of our grammar 
changes because there can exist self-loops for machines and operators. This is not allowed 
in Fig. A. 2. However, negative conditions are needed in the type graph (there can be self- 
loops in machines or operators but not connections between two operators or between 
two machines). See Fig. A. 10. We need to demand A\ for every single edge (using the 
decoposition operator T of Sec. 7.3) and the nonexistence of matchings with A 2 and As. 

Up to now a single operator could be in charge of more than one machine so if there 
are edges from the operator to several machines, all machines may work simultaneously. 
Besides, there can be more than one operator working on the same machine. In a probably 
more realistic grammar, these two scenarios could not take place. These restrictions will 
be addressed in section A. 5. 

The production process of any machine can be split into two phases: If there are 
enough elements to start its job, then the input pieces disappear and the machine and 
the operator become busy. After that, some output piece is produced and the machine 
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b&4 



Fig. A. 11. Production assemble in Greater Detail 

and the operator become idle again. This is represented in the sequence of Fig. A.ll. 
Note that assemble = assemblei o assemble 2 . 

If we limit our Matrix Graph Grammar to deal with simple digraphs we have a built-in 
application condition "for free" . Even though one operator can still be in charge of several 
machines simultaneously, he will manage at most one machine at a time. Otherwise, two 
self-loops would be added violating compatibility. 

Application conditions are needed if productions we want to set restrictions on pro- 
ductions move. This can be permitted if the machine has a kind of "pause" , so the machine 
(which is busy as it has a self loop) can resume as soon as an operator moves to it. It is 
not necessary to specify a restriction to state that a machine can not start a job when 
the operator is busy, as the rule would try to append a second self-loop to the operator 
(something not allowed if we are limited to simple digraphs) . 

Sequences can be generated at design time to debug the grammar or during runtime to 
force a set of events. They can also be automatically generated by application conditions 
or can be associated to other concepts, such as reachability. 

A. 3 Initial Digraph Sets and G-Congruence 

To calculate the initial digraph set of sequence s 0 = pack ; certify ; assem we start with 
the maximal initial digraph Mo, the digraph that unrelatcs all elements for different pro- 
ductions. It is formed by the disjoint union of the left hand sides of the three productions 
in sequence s 0 . The rest of elements Mi of the initial digraph set 9Jt (so) are derived by 
identifying nodes and edges in Mq . These identifications however can not be carried out 
arbitrarily because any Mi e 9Jt(so) must satisfy (4.46). Hence, there are identifications 
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that make some elements unnecessary. For example, if the output conveyor of production 
certify is identified with the input conveyor of pack, then item3 (mandatory for the 
application of pack) is not needed anymore because it will be provided by certify. 




Fig. A. 12. MID and Excerpt of the Initial Digraph Set of s 0 = pack ; certify ; assem 



For so we will label cl and c2 the input conveyors of assemble and c3 its ouput 
conveyor. Similarly, we have c4 and c5 for certify and c6 and c7 for pack. Operators 
will be labelled accordingly so ol is the one in assemble, o2 in certify and o3 in pack. 
There are two machines for packing, ml the one in certify and TO2 in pack. See the 
graph on the left of Fig. A. 13. No identification prevents any other 3 in WI(sq), so the 
number of elements in 971 (sq) grows factorially. In this case, since there are 6 possible 
identifications we have 720 possibilities. In Fig. A. 12 a part of the initial digraph set 
can be found on the right. The string that appears close to each arrow specifies the 
identification (top-bottom) performed to derive the corresponding initial digraph. 

Initial digraph sets can be useful to debug a grammar. By choosing certain testing 
sequences it is possible to automatically select "extreme" cases in which as many elements 

3 For an example in which not all identifications are permitted refer to Sec. 5.3, Fig. 5.7. 
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as possible are identified or unrelated. For example, the development framework can tell 
that a single operator may manage all machines with the grammar as defined so far, but 
maybe this was not the intended behaviour. 




Fig. A. 13. MID for Sequences si and s 2 



G-congruence and congruence conditions guarantee the sameness of the minimal ini- 
tial digraph. They also provide information on what elements are spoiling this property. 
Consider the sequences Si = re j ect ; assemble ; recycle and s 2 = assemble ; recycle ; 
reject, where in s 2 the application of production reject has been advanced two posi- 
tions with respect to Si. The minimial initial digraphs of both sequences can be found 
in Fig. A. 13. By the way, notice that M(si) are invariants for these transformations, i.e. 
(M(si)) = M{si). 

G-congruence is characterized in terms of congruence conditions in theorem 6.1.6. 
Congruence conditions for the advancement of a single production inside a sequence 
are stated in proposition 6.1.2, in particular in (6.22). Simplified and adapted for this 
case with nodes ordered [l:iteml 1 : item2 1 : item3 1 : conv 2:conv 3:conv 4:conv 
l:macA l:macQ l:macD 1 : op] : 4 



4 Where subscript 1 stands for rule recycle, subscript 2 is assemble and subscript 3 reject. 
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The congruence condition fails precisely in those elements that make both minimal 
initial digraph different, (i3,3c) and (i3,4c). See Fig. A. 13. 




Fig. A. 14. Ordered Items in Conveyors 



Relevant matrices in previous calculations can be found in (A. 9) and (A. 10) for rules 
recycle and reject, and in Sec. A.l for assemble, in particular equations (A.l) and 
(A. 3). For identifications across productions see Figs. A. 13 and A. 14. 
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(A.9) 



(A.10) 



A.4 Reachability 

In this section reachability is addressed together with some comments on other problems 
such as confluence, termination and complexity (to be addressed in a future contribution). 

Throughout the book some techniques to deal with sequences have been developed. 
Sequences to be studied have to be supplied by the user. Reachability is a more indirect 
source of sequences, as initial and final states are specified and the system provides us 
with sets of candidates. 





Fig. A. 15. Initial and Final Digraphs for Reachability Example 



We shall use similar initial and final states as those in Fig. A. 8 (see Fig. A. 15). Our 
grammar as defined so far has a fixed behaviour, i.e. it is a fixed graph grammar. The 
state equation for this case is given by (8.9) in Prop. 8.3.3. 
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Let oS and d,S be the initial and final states and the ordering [l:iteml l:item2 
l:item3 l:item4 1 : conv 2:conv 3:conv 4:conv 5:conv 6:conv l:machA l:machQ 
1 : machD 1 : machP 1 : op] . Nodes appear in the last column. 
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(A.11) 



For tensor A^ k only the basic productions assem, certify, reject, recycle and pack 
are considered plus those for operator movement mov2*. Following Sec. 8.2, grammar 
rules that add and delete elements of the same type are splitted in their addition (+) 
and deletion (-) parts. This includes only productions certify and reject. 5 

The set of rules is {assem, certify + , certify - , reject + , reject - , recycle, pack, 
mov2A, mov2Q, mov2D, mov2P}, so k e {1, ...,11}. This ordering is kept in the equations 
that follow. 

The following list summarizes all actions performed by the grammar rules under 
consideration on nodes and edges. A plus sign between brackets means that the element 
is added and a minus sign that it is deleted. 



• (1 : iteml, 1 : conv) 

• (1 : item2, 2 : conv) 

• (1 : item3, 3: conv) 

• (l:item3, 4: conv) 



assem (— ) , recycle (+) 
assem (— ) , recycle (+) 
assem (+) , certify - (— 
certify" 1 " (+) , pack (— ) 



.reject (-) 



Note that neither certify nor reject add or delete the iteml node. They only act on edges. 
These productions are splitted because the edge deleted and the edge added are of the same 
type, (iteml, conv). 
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(1 : item3, 6 : conv) i — > reject" 1 " (+) , recycle (— ) 
(1 : item4, 5 : conv) i — > pack (+) 
(1 : op, 1 :machA) i — > mov2A (+) ,mov2Q (-) 
(1 : op, 1 :machQ) i — > mov2Q (+) , mov2P (-) 
(l:op, l:machD) i — > mov2D(+) ,mov2A(-) 
(l:op, l:machP) i — > mov2P(+) ,mov2D(-) 
(1 : iteml) i — > assem (— ) , recycle (+) 
(1 : item2) i — > assem (— ) , recycle (+) 
(1 : item3) i — > assem (+) , recycle (— ) , pack (— ) 
(1 : item4) i — > pack (+) 

What is finally derived according to the methods proposed in Chap. 8 is a system 
of linear equations. To those arising from the tensor equations another thirteen must be 
appended: 



{x. 



k 

q 

x 2 = x 3 

p q 



X P ~ X <J' 



Xqh V,1 6 {1, • • • , 11} 



First set of equations guarantee that a rule is applied a concrete number of times. 
Second and third equations do not allow inconsistencies for rules certify and reject, 
that have been splitted in their addition and deletion parts. They have to be applied the 
same amount of times. 

Only those columns of M for which some "activity" is defined in the productions are 
of interest, i.e. all except the first four. Zero elements are not included, but substituted 
by bold zeros: 



li 



fc = l 



-x\ + X% 



11 
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*;=i 



x 6 + x 6 
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Mi6 corresponds to nodes. Recall that x must satisfy the additional conditions 



x*, fee {1,.. .,11}. The system has the solution: 



(x, l,l,x -l,x -l,x -l,l,y -l,y,y - l,y) = 0. 



(A.12) 



being s 0 - see equation (A.4) - one of the sequences for x = 1, y = 1. Note that solutions 
are uncoupled in two parts: The one that rules operator movement (y) and that of items 
processing (x). 

This is a good example to study termination and confluence. Any evolution of the 
system having as initial state the one depicted on the left of Fig. A. 15 will eventually 
get to the state on the right of the same figure (termination). 6 The grammar is confluent 
(there is a single solution) although there is no upper bound to the number of steps it 
will take to get to its final state (complexity). Depending on the probability distribution 
there will be more chances to end up sooner or later. Independently of the distribution, 
larger sequences have smaller probabilities, being their probability zero in the limit (if 
the probability assigned to rejecting iteml is different from 1). 



In fact, it is not terminating because the productions that move the operator can still be 
applied. What we would need is another production that drives the system to a halting state. 
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A. 5 Graph Constraints and Application Conditions 

Application conditions and graph constraints will make our case study much more real- 
istic. We will see two examples on how application conditions can be used to limit the 
applicability of rules or to avoid undesired behaviours. 




The first is based on the remark that conveyors as presented so far have infinite 
capacity to load items. Probably either due to a limit of space or of load, conveyors can 
not transport more than, say, two items. This is a constraint on the whole system, which 
can be modelled as a graph constraint as introduced in Chap. 7. Figure A. 16 shows a 
diagram 0q that sets this limit, with associated formula: 



fo = tAi . . . A 6 



A 6 



A * 



(A.13) 



Recall that if the quantifier is not repeated it means that it applies to every term, 
e.g. $A X A 2 = MiMa- 
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Graphs A 5 and A 6 are necessary because rule recycle may mix elements of type 
iteml and item2 in the same conveyor. This graph constraint will be named GCq = 
(fo,fo)- By using variable nodes - see Sec. 7.5 - the diagram and the formula would be 
simpler, similar to the example on page 166, in particular the right side of Fig. 7.4. In 
the end, the diagram and the formula would be instantiated to a graph constraint similar 
to what appears on Fig. A. 16 and equation (A. 13). 




Fig. A. 17. Graph Constraint as Precondition and Postcondition 



This graph constraint is depicted as precondition and postcondition on Fig. A. 17. 
The equations are those adapted from (A. 13): 



h = $ A 2 qA 2 i 



A 20 v A 2 i 



fa = t A 2Q A 21 [a 2Q v A 2 il 



(A.14) 
(A.15) 



Only the diagram in which elements of type item3 appear has been kept because we 
know that in conveyor labelled 1 there should not be items of any other type (they would 
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never be processed). Actually, with the definitions of rules given up to now, conveyors 
connecting different machines are of the same kind. Hence, all six diagrams should appear 
on reject's left hand side and their transformation, according to theorem 7.4.6, on its 
right hand side. 

The precondition and the postcondition can be transformed into equivalent sequences 

according to theorems 7.3.5 and 7.4.2. This is a negative application condition, see 

theorem 7.2.4 and lemma 7.3.4. Hence, they arc split into two subconditions, each 

<— 

one demanding the nonexistence of one element. A 2Q will ask for the nonexistence of 
edge (2 : item3, 1 : conv) and A 20 for (3 : item3, 1 : conv). Similarly we have A' 21 for 
(2 : item3, 2 : conv) and A 21 for (3 : item3, 2 : conv). 7 At least one element in each case 
must not be present, so there are four combinations: 

reject i — > < reject; id <j ;id ^ , reject; id *- ; id ^ , 

^ ^21 ^20 ^21 ^20 

reject; id ^ ;id *y , reject; id ^ ;id <j > (A.16) 

^21 ^20 ^21 ^20 J 

The corresponding formula - the left arrow on top is omitted - can be written: 

3^20^20^210^21 [(^20 v ^20 ) (^21 v ^2l)] (A- 17) 

Here postconditions and preconditions turn out to be the same because re j ect lid y 
and re j ect _L id ^ . For each sequence it is posible to compose all productions and derive 

^2x 

a unique rule. If so, as there are just elements that have to be found in the complement 
of the host graph, they are appended to the nihilation matrix of the composition. 

For graph constraints, if something is to be forbidden, it is more common to think 
in "what should not be" , i.e. to think it as a postcondition (graph constraint GCo is of 
this type). On the contrary, if something is to be demanded then it is normally easier to 
describe it as a precondition. 



7 To be precise, there would be other two conditions asking for the nonexistence of 
(1 : item3, 1 : conv), however this part of the application condition is inconsistent for the 
first conveyor (this edge is demanded because it has to be erased) and redundant for the 
second conveyor (it would be fulfilled always because this edge is going to be added, so it can 
not exist in the left hand side) . This stems from the theory developed in Chap. 7. 
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Let's continue with another property of our system not addressed up to now. Note 
that conveyors clearly have a direction: Each one is the output of one or more machines 
and input of one or more machines. In our example this is simplified so conveyors just 
join two different machines. What might be of interest is that items in conveyors are 
naturally ordered. Machines should pick the first ordered element. 

To make our assembly line realize this feature, when the machine processes a new 
item - 2:item3 in Fig. A. 18 - and there is already an item in the output conveyor - 
l:item3 in Fig. A. 18 an edge from 2:item3 to l:item3 will be added. A chain is 
thus defined: The first element will have an incoming edge form another item, but it will 
not be the source of any edge that ends in other item. The last item will not have any 
incoming edge but one outgoing edge to another item. It has been exemplified for rule 
reject in Fig. A. 18. 8 





N 
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1 O 


I 

reject 


S3 


-30 

1 -131 



Fig. A. 18. Ordered Items in Conveyors 



Again we have to change the alloweable connections among types. The diagram in 
Fig. A. 10 needs to be further extended with a self- loop for items (there can be edges now) 
joining two of them. However, concrete items can not have self-loops, so a new graph 
constraint should take care of this. 

This ordering convention poses two problems when the rule is applied: 

1. If the input conveyor has two or more items, the first - the one with incoming edges 
- should be used. 

2. If the output conveyor has one or more items, the new item must be linked to the 
last one. 

8 We are not going to propose the modification of every single rule to handle ordering in 
conveyors. On the contrary, we are going to propose a method based on graph constraints 
and application conditions that automatically takes care of ordering. 
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The first if statement (pick the elder item) can be modelled by an application con- 
dition. We have a precondition A= (fx, t>i) with: 

fi = yA x 3A 2 p! a A 2 ]. (A. 18) 



Q □ 




Fig. A. 19. Expanded Rule reject 



The diagram is represented in Fig. A. 19. Numbered elements are related by the cor- 
responding morphisms. In formula fi the term MA\ . . . pi . . .] prevents the application 
of the rule if there is some marked item in the output conveyor (the blue square, read 
below). If the rule was applied then there would be two "last" items and it should become 
impossible to distinguish which one was added first. The term . . . 3A 2 [. . . A% \ forces the 
rule to pick the first item in the chain, just in case there was a chain. Item 1 : item3 will 
be chosen cither if it is the first in the chain or it is alone. This is equivalent to demand 
one item that has no outgoing edges to any other item. 

The second if statement can not be modelled with an application condition. The 
reason is that we need to add one edge in case a "last" item exists in the output conveyor 
(if the output conveyor is empty, then the rule should simply add the item). Applica- 
tion conditions arc limited to checking whether (almost any arbitrary combination of) 
elements are present or not, but they can not directly modify the actions of the rules. 
Anyway, the solution is not difficult: 

1. The newly added element needs to be marked so the last item in the conveyor can 
be identified: The blue square of A\ in Fig. A. 19 marks the last item added. 
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2. A precondition has to be imposed such that if there are marked items in the output 
conveyor, the rule can not be applied (this way at most one unlinked item will exist 
in each output conveyor). Again, see A\ in Fig. A. 19 and the corresponding term in 
(A.18). 

3. The grammar is enlarged with a new rule that checks if there are unlinked items 
(linking them, remMark2) and another that unmarks them if they are alone in the 
conveyor, remMarkl. See Fig. A. 20 
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Fig. A. 20. Rules to Remove Last Item Marks 



Both productions remMarkl and remMark2 have application conditions, AC\ = 
(fi , 0i = {Bi}) and AC 2 = (f 2 ,Q 2 = {B2}) respectively. The corresponding formulas are: 

fi = ^1 [B{\ 

f 2 = VS 2 \B 2 ] = jB 2 [B 2 ] 

Production remMarkl can be applied only if there is just a single item in the con- 
veyor. remMark2 applies when there is more than one item. B 2 selects the last item: It is 
equivalent to "the item with no incoming eges" . 

There is no problem in transforming both preconditions of Fig. A. 19 into postcon- 
ditions. Note that there are no dangling elements in A 2 because 1 : item3 is not erased 
(which would mean erasing and adding the same element, something forbidden in Matrix 
Graph Grammars, see comments right after Prop. 4.1.4). 

Notice that we have included ordering in conveyors with graph constraints and appli- 
cation conditions (there exists the possibility to transform one into the other), without 
really modifying existent grammar rules. Ordering is a property of the system and not of 
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the productions, which should just take care of the actions to be performed. We think that 
Matrix Graph Grammars clearly separate both topics: It is feasible to specify grammar 
rules first and properties of the system afterwards. With the theory developed in Chap. 
7 a framework - such as AToM 3 - can relate one to the other more or less automatically. 
Other examples of restrictions and limitations that can be imposed on the case study 

arc: 

• Limitations on the number of operators, e.g. a maximum of four operators. 

• An operator can be in charge of at most one machine. 

• There should not be two operators working in the same machine, which is a restriction 
on rules of type mov2*. 

More general constraints such as the number of operators can not exceed the number 
of machines are also possible, although variable nodes would be needed in this case. 

The examples so far are simple and can be expressed with other approaches to the 
topic. For other natural application conditions that can only be addressed with Matrix 
Graph Grammar approaches (to the best of our knowledge), please refer to the example 
on p. 180 or to [64]. The example studied in this appendix is a extended version of the 
one that appears there. 

A. 6 Derivations 

In this section a slight modification of the initial state depited in Fig. A. 6 together 
with a permutation of sequence so will be used again, but enlarged with ordering of 
productions (sequences) and restrictions of Sec. A. 5. Internal and external e-productions 
will be addressed in passing. 

Let's consider as initial state the one depicted in Fig. A. 21. Due to restrictions, 
sequence s 0 = pack; certify ; assem is not applicable (three items would appear in the 
input conveyor of pack). However, productions are all sequentially independent because 
they are applied to different items (due to the amount of elements available in the initial 
state in Fig. A. 21), so sequence s' 5 = certify ; pack ; assem can be considered instead. 

Sequence s' 5 can not be applied because the operator has to move to the appropriate 
machine and ordering of items needs to be considered. Let's suppose that the four basic 
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Fig. A. 21. Grammar Initial State for s' 5 



rules have a higher probability - or that they are in a higher layer, as e.g. in AGG 9 - 
so as soon as one of them is applicable it is in fact applied. According to the way an 
operator may move in our assembly line, applying s' 5 would need at least the following 
rules: 

s" = certify ; mov2Q ; mov2A ; recycle ; mov2D ; pack ; mov2P ; mov2Q ; assem. (A. 19) 

Production reject could have been applied somewhere in the sequence. Again, as 
items are ordered and some dangling edges appear during the process, this is not enough 
and some other productions need to be appended: 

S5 = (remMark2 ; certify ; certify,) ; mov2Q ; mov2A ; recycle ; mov2D ; 

(remMark2 ; pack ; pack £ ) ; mov2P ; mov2Q ; (remMark2 ; assem ; assem e ) 



C « ' 


assem f ^ G> ^ 


L 


pack £ ! 


< i 


c « 




certify | 3 0 



Fig. A. 22. Production to Remove Dangling Edges (Ordering of Items in Conveyors) 



Parentheses are used to isolate subsequences that could probably be composed to 
obtain more "natural" atomic actions. See Fig. A. 20 for the definition of remMark2 and 
9 AToM 3 has priorities. 
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Fig. A. 22 for assem e , pack £ and certify,,. In this case, both assem e and pack e are 
external while certify^ is internal. Productions between brackets are related through 
marking operator. It is mandatory that they act on the same nodes and edges. 

A user of a tool such as AToM 3 or AGG does not necessarily need to know about 
e-productions, even less about marking. Probably in this case it should be better to 
compose productions that include remMarkl or remMark2 and call them as the original 
rule, e.g. remMark2; assem i — > assem. The final state for S5 can be found in Fig. A. 23 



e 

G 



Fig. A. 23. Grammar Final State for S5 



A development framework should have facilities to case visualization of grammar 
rules, as diagrams can be quite cumbersome with only a few constraints. For example, 
it should be possible to keep graph constraints apart from productions, calculating on 
demand how a concrete constraint modifies a selected production, its left and right hand 
sides and nihilation matrix. 



References 



[1] Agrawal, A. 2004. A Formal Graph Transformation Based Language for Model-to- 
Model Transformations. Ph.D. Dissertation. Nashville, Tennessee. 

[2] Baldan, P., Corradini, A., Ehrig, H., L6we, M., Montanari, U. and Rossi, F., 1999. 
Concurrent Semantics of Algebraic Graph Transformations. In [24], pp.: 107-187. 

[3] Baudcron, M., Hclcne, J. 2001. Fullback as a Generic Graph Rewriting Mechanism. 
Applied Categorical Structures, 9(l):65-82. 

[4] Bauderon, M. 1995. Parallel Rewriting Through the Fullback Approach. Electronic 
Notes, 2. SEGRAGRA'95. 

[5] Bauderon, M. 1997. A Uniform Approach to Graph Rewriting: the Fullback Ap- 
proach. In Manfred Nagl, editor, Graph Theoretic Concepts in Computer Science, 
WG '96, Vol. 1017 of LNCS, pp. 101-115. Springer. 

[6] Brown, R., Morris, I., Shrimpton J., Wensley, CD. 2006. Graphs of Graphs and 
Morphisms. Preprint available at: http : //www . informatics . bangor . ac . uk/publi 
c/math/ research/ ftp/ cathom/ 06_04 . pdf 

[7] Biichi, J. 1960. Weak Second-Order Logic and Finite Automata. In Z Math. Logik 
Grundlagen Math. 5, 62-92. 

[8] Cormen, T., Leiserson, C, Rivest, R. 1990. Introduction to Algorithms. McGraw- 
Hill. 

[9] Corradini, A., Heindel, T., Hermann, F., Knig, B. 2006. Sesqui-pushout Rewriting. In 
Proc. of ICGT '06 (International Conference on Graph Transformation), pp. 30-45. 
Springer. LNCS 4178. 



270 References 



[10] Corradini, A., Montanari, U., Rossi, F. 1996. Graph Processes. Fundamenta Infor- 
maticac. Vol. 26. p. 241-265. 

[11] Corradini, A., Montanari, U., Rossi, F., Ehrig, H., Heckel, R., Lowe, M. 1999. Al- 
gebraic Approaches to Graph Transformation - Part I: Basic Concepts and Double 
Pushout Approach. In [23], pp.: 163-246 

[12] Courcellc, B. 1997. The expression of graph properties and graph transformations in 
monadic second-order logic. In [23], pp.: 313-400. 

[13] Drewes, F., Habel, A., Kreowski, H.-J., Taubenberger, S. 1995. Generating self-affine 
fractals by collage grammars. Theoretical Computer Science 145:159-187, 1995. 

[14] Ehrig, H., Ehrig, K., Habel, A., Pennemann, K.-H. 2006. Theory of Constraints 
and Application Conditions: From Graphs to High-Level Structures. Fundamenta 
Informaticae (74) pp.: 135-166, 2006 

[15] Ehrig, H., Ehrig, K., de Lara, J., Taentzer, T., Varro, D., Varro-Gyapay, S. 2005. 
Termination Criteria for Model Transformation. Proceedings of Fundamental Ap- 
proaches to Software Engineering FASE05 (ETAPS'05). Lecture Notes in Computer 
Science 3442 pp.: 49-63. Springer. 

[16] Ehrig, FL, Habel, A., Kreowski, H.-J., Parisi-Presicce, F. 1991. From Graph Gram- 
mars to High Level Replacement Systems. In H. Ehrig, H. J. Kreowski and G. Rozen- 
berg, editors, Graph Grammars and Their Application to Computer Science, vol. 532 
of LNCS, pp. 269-291. Springer. 

[17] Ehrig, H., Habel, A., Kreowski, H.-J., Parisi-Presicce, F. 1991. Parallelism and Con- 
currency in High-Level Replacement Systems. Mathematical Structures in Computer 
Science, l(3):361-404. 

[18] Ehrig, H., Habel, A., Padberg, J, Prange, U. 2004. Adhesive High-Level Replacement 
Categories and Systems. In H. Ehrig, G. Engels, F. Parisi-Presicce and G. Rozenberg, 
editors, Proceedings of ICGT 2004, Vol. 3256 of LNCS, pp. 144-160. Springer. 

[19] Ehrig, H. 1979. Introduction to the Algebraic Theory of Graph Grammars. In V. 
Claus, H. Ehrig, and G. Rozenberg (eds.), 1st Graph Grammar Workshop, pp. 1-69. 
Springer LNCS 73. 

[20] Ehrig, H., Nagl, M., Rozenberg, G., Rosenfeld, A., editors, 1987 Graph- Grammars 
and Their Application to Computer Science, 3rd International Workshop, Vol. 291 
of LNCS. Springer. 



References 271 



[21] Ehrig, H., Pfcndcr, M., and Schneider, H. J. 1973. Graph grammars: An Algebraic 
Approach. In Proc. IEEE Conf. on Automata and Switching Theory, SWAT '73, pp. 
167-180. 

[22] Ehrig, H., Ehrig, K., Prange, U., Taentzer, G. 2006. Fundamentals of Algebraic 

Graph Transformation. Springer. 
[23] Ehrig, H., Engcls, G., Krcowski, H.-J., Rozcnbcrg, G. 1999. Handbook of Graph 

Grammars and Computing by Graph Transformation. Vol 1. Foundations. World 

Scientific. 

[24] Ehrig, H., Kreowski, H.-J., Montanari, U., Rozenberg, G. 1999. Handbook of Graph 
Grammars and Computing by Graph Transformation. Vol.3., Concurrency, Paral- 
lelism and Distribution. World Scientific. 

[25] Eilcnberg, S. MacLane, S. 1945. General Theory of Natural Equivalence, Trans. 
Amcr. Soc. 231. 

[26] Elgot, C. 1961. Decision Problems of Finite Automata Design and Related Arith- 
metics. Trans. A.M.S. 98, 21-52. 
[27] Feder, J. 1971. Plex Languages. Information Sciences, 3:225-241. 
[28] Fokkinga, M. M. 1992. A Gentle Introduction to Category Theory — the Calcula- 

tional Approach. University of Utrecht. In Lecture Notes of the 1992 Summerschool 

on Constructive Algorithmics. pp.: 1-72. 
[29] Gulmann, J., Jensen, J., J0rgensen, M., Klarhmd, N., Rauhe, T., and Sandholm, A. 

1995. Mona: Monadic second-order logic in practice. In U.H. Engbcrg, K.G. Larsen, 

and A. Skou, editors, TACAS, pp. 58-73. Springer Verlag, LNCS. 
[30] Kreuzer, T. L. 2003. Term Rewriting Systems. Cambridge University Press. 
[31] Hcckel, R., Kiistcr, J. M., Taentzer, G. 2002. Confluence of Typed Attributed Graph 

Transformation Systems. In ICGT'2002. LNCS 2505, pp.: 161-176. Springer. 
[32] Hcckel, R., Wagner, A. 1995. Ensuring Consistency of Conditional Graph Grammars 

- A Constructive Approach -. Electronic Notes in Theoretical Computer Science 2. 
[33] Hcinbockcl, J.H. 1996. Introduction to Tensor Calculus and Continuum Mechanics. 

Old Dominion University. Free version (80% of Material) Avail, at http://www. 

math . odu . edu/~jhh/ counter2 . html. 
[34] Hoffman, B. 2005. Graph Transformation with Variables. In Graph Transformation, 

Vol. 3393/2005 of LNCS, pp. 101-115. Springer. 



272 References 



[35] Lammel, R., Mernik, M., eds., 2001. Domain- Specific Languages. Special Issue of 
the Journal of Computing and Information Technology ( GIT). 

[36] Kahl, W., 2002. A Relation- Algebraic Approach to Graph Structure Transformation. 
PhD Thesis. 

[37] Kauffman, L.H. Knots. Avail, at http://www.math.uic.edU/~kauffman/Tots/K 
nots . htm 

[38] Kawahara, Y. 1973. Relations in Categories with Pullbacks. Mem. Fac. Sci. Kyushu 
Univ. Ser. A, 27(1): 149-173. 

[39] Kawahara, Y. 1973. Matrix Calculus in I-categories and an Axiomatic Character- 
ization of Relations in a Regular Category. Mem. Fac. Sci. Kyushu Univ. Ser. A, 
27(2): 249-273. 

[40] Kawahara, Y. 1973. Notes on the Universality of Relational Functors. Mem. Fac. 
Sci. Kyushu Univ. Ser. A, 27(2): 275-289. 

[41] Kennaway, R., 1987. On Graph Rewritings. Theoretical Computer Science, 52:37-58. 

[42] Kennaway, R. 1991. Graph Rewriting in Some Categories of Partial Morphisms. In 
Ehrig ct al. [20], pp. 490-504. 

[43] Lack, S., Sobocihski, P. 2004. Adhesive Categories. In I. Walukievicz, editor, Pro- 
ceedings of FOSSACS 2001 Vol. 2987 of LNCS, pp. 273-288. Springer. 

[44] de Lara, J., Hans Vangheluwe, H. 2002. AToM 3 : A Tool for Multi-Formalism Mod- 
elling and M eta- Modelling. LNCS 2306, pp.:174-188. Fundamental Approaches to 
Software Engineering - FASE'02, in European Joint Conferences on Theory And 
Practice of Software - ETAPS'02 . Grenoble. France. 

[45] de Lara, J., Vangheluwe, H., 2004. Defining Visual Notations and Their Manipu- 
lation Through M eta- Modelling and Graph Transformation. Journal of Visual Lan- 
guages and Computing. Special Issue on "Domain-Specific Modeling with Visual 
Languages", Vol 15(3-4), pp.: 309-330. Elsevier Science 

[46] de Lara, J., Bardohl, R., Ehrig, H., Ehrig, K., Prange, U, Taentzer, G. 2007. At- 
tributed Graph Transformation with Node Type Inheritance. Theoretical Computer 
Science (Elsevier), 376(3): 139-163. 

[47] Mcndelson, E. 1997. Introduction to Mathematical Logic, Fourth Edition. Chapman 
& Hall. 



References 273 



[48] Lowe, M., 1990. Algebraic Approach to Graph Transformation Based on Single 

Pushout Derivations. Technical Report 90/05, TU Berlin. 
[49] Mac Lane, S. 1998. Categories for the Working Mathematician. Springer. ISBN 0- 

387-98403-8. 

[50] Minas, M. 2002. Concepts and Realization of a Diagram Editor Generator Based on 
Hypergraph Transformation. Science of Computer Programming, Vol. 44(2), pp: 157 
- 180. 

[51] Mizoguchi, Y., Kawahara, Y. 1995. Relational Graph Rewritings. Theoretical Com- 
puter Science, Vol 141, pp. 311-328. 

[52] Manzano, M. 1996. Extensions of First- Order Logics (Cambridge Tracts in Theoret- 
ical Computer Science). Cambridge University Press. 

[53] Murata, T. 1989. Petri nets: Properties, Analysis and Applications. Proceedings of 
the IEEE, Vol 77(4), pp. 541-580. 

[54] Nagl, M. 1976. Formal Languages of Labelled Graphs. Computing 16, 113-137. 

[55] Nagl, M. 1979. Graph- Grammatiken. Vieweg, Braunschweig. 

[56] Newman, J. 1956. the World of Mathematics. Simon & Schuster, New York. 

[57] Papadimitriou, C. 1993. Computational Complexity. Addison Wesley. 

[58] Pavlidis, T. 1972. Linear and Context-Free Graph Grammars. Journal of the ACM, 
19(l):ll-23. 

[59] Perez Velasco, P. P., de Lara, J. 2006. Towards a New Algebraic Approach to Graph 

Transformation: Long Version. Technical Report of the School of Computer Science, 

Universidad Autonoma de Madrid. Available at http://www.ii.uam.es/~jlara/ 

invest igacion/techrep_03_06 . pdf . 
[60] Perez Velasco, P. P., de Lara, J. 2006. Matrix Approach to Graph Transformation. 

Mathematical Aspects of Computer Science. Proc. ICM'06, Vol. Abstracts, p. 128. 

European Mathematical Society. 
[61] Perez Velasco, P. P., de Lara, J. 2006. Matrix Approach to Graph Transformation: 

Matching and Sequences. Proc. ICGT'06, LNCS 4218, pp.:122-137. Springer. 
[62] Perez Velasco, P. P., de Lara, J. 2006. Petri Nets and Matrix Graph Grammars: 

Reachability. Proc. PN-GT'06, Electronic Communications of EASST(2). 



274 References 

[63] Perez Velasco, P. P., de Lara, J. 2007. Using Graph Grammars for the Analysis of Be- 
havioural Specifications: Sequential and Parallel Independence. Proc. PROLE'2007. 
Also as ENTCS (Elsevier). 

[64] Perez Velasco, P. P., de Lara, J. 2007. Analysing Rules with Application Conditions 
Using Matrix Graph Grammars. Proc. GT-VC'2007. 

[65] Penrose, R. 2006. The Road to Reality: a Complete Guide to the Laws of the Universe. 
Knof, 0679454438. 

[66] Pfaltz, J.L., Rosenfcld, A. 1969. Web Grammars. Proc. Int. Jont Conf. Art. Intelli- 
gence, Washington, 1969, pp. 609-619. 

[67] Raoult, J. C, 1984. On Graph Rewritings. Theoretical Computer Science, 32:1-24. 

[68] Reisig, W., 1985. Petri Nets, an Introduction. Springer- Verlag, Berlin. 

[69] Schneider, H. J. 1970. Chomsky- System fur Partielle Ordnungen, Arbeitsber. d. Inst, 
f. Math. Masch. u. Datenver. 3, Erlangen. 

[70] Schiirr, A. 1994. Specification of Graph Translators with Triple Graph Grammars. 
Proc. 20th International Workshop on Graph-Theoretic Concepts in Computer Sci- 
ence. LNCS 903, pp.: 151 - 163. Springer. 

[71] Smullyan, R. 1995. First-Order Logic. Dover Publications. 

[72] Sokolnikoff, I.S. 1951. Tensor Analysis, Theory and Applications. John Wiley and 
Sons. 

[73] Taentzer, G. 2004. AGG: A Graph Transformation Environment for Modeling and 
Validation of Software. AGTIVE 2003, LNCS 3062, pp.: 446-453. Springer. 

[74] Terese. 2003. Term Rewriting Systems. Cambridge University Press. 

[75] Thomas, W. 1990. Automata on Infinite Objects. In J. van Leeuwen, editor, Handbook 
of Theoretical Computer Science, Vol. B, pp. 133-198. MIT Press/Elsevier. 

[76] Vollmer, H. 1999. Introduction to Circuit Complexity: A Uniform Approach. Text in 
Theoretical Computer Science. EATCS Series. 



Index 



abelian group 37 
adjacency matrix 27 
adjoint operator 36 
allegory 64 

distributive 65 
amalgamation 46 
analysis of a derivation 46 
applicability 7 
application condition 47 

coherent 172 

compatible 172 

consistent 172 

in MGG 168 

weak 168 
arity 16 

Banach space 36 
binary relation 60 
Boolean matrix product 29 
boundedness 227 

categorical product 21 
category 19 



Graph 20 
Graph p 20 
PTNets 25 
Poset 24 
Rel 62 
Set 19 
Set p 63 
Top 24 

adhesive HLR 23 
Dedekind 65 
weak adhesive HLR 
class 19 

closed formula 16 

closure 175 

cocone 22 

coherence 80, 215 

colimit 22 

compatibility 215 
graph 30 
production 72 
sequence 104 

completion 76 



276 Index 



complexity 259 
composition 106 
concatenation 79 
cone 22 

conflict-free condition 49 

confluence 9 

congruence condition 138 

negative 138 

positive 138 
context graph 43 
contraction 33 
contravariance 33 
coproduct 22 
covariance 33 
cycle 38 

dangling 

condition 30, 43 

edge 3, 30 
daughter graph 53 
decomposition 176 
definition scheme 61 
derivation 8 

exact 129 
diagram 162, 166 
direct derivation 8 

DPO 43 

MGG 113 

SPO 49 
direct transformation 48 
distance 36 
domain 63 

domain of discourse 17 
double pullback (DPB) 51 
double pushout (DPO) 42 



DSL, Domain-Specific Languages 237 
dual space 35 

e-production 

adjoint operator 119 
edge 

addition 69 

deletion 69 

external 128 

internal 128 

type 76 

fixed grammar 120 
floating grammar 120 
FOL 

connective 15 

constant 15 

first order logic 15 

function 15 

quantifier 15 

symbol 15 

variable 15 
function 

partial 62 

total 62 
functional representation 

closure 186, 191 

decomposition 185, 191 

match 182, 191 

negative application condition 191 
negative application condition 187 
production 117 
functor 20 

G-congruence 134 
gluing condition 44 



Index 277 



graph constraint 166 

fulfillment 170 
graph pattern 199 
ground formula 16, 166 
group 37 

Hilbert space 34 
hyperedge 56 
hypergraph 57 
isomorphism 57 

identification condition 43 
identity conjugate 182 
incidence matrix 27, 216 
incidence tensor 220 

matrices 216 
independence 8 
initial digraph 

actual 128 

set 123 
initial object 19 
inner product 33, 34 
interface 42 

interpretation function 17 
invariants 

place 227 

transition 227 

kernel (graph) 199 
Kronecker delta 33 
Kronecker product 32 

Levi-Civita symbol 33 
LHS, Left Hand Side 69 
limit 22 
line graph 27 



liveness 227 

marking 210 

minimal 214 

operator 121 
match 

DPO 43 

extended 116 

MGG 112 

SPO 49 
metric 36 
metric tensor 33 
MGG, Matrix Graph Grammar 6 
minimal initial digraph 91 
monadic second order logic, MSOL 18 
morphism 

partial 63 
mother graph 53 
multidigraph constraints 203 
multigraph 20 
multinode 200 

NCE 54 
negative 

application condition 47 

graph constraint 47 

initial digraph 101 

initial set 125 
nihilation matrix 98 
NLC 53 
node 

addition 69 

deletion 69 

type 74 

vector 28 



278 Index 



norm 35 

of Boolean vector 30 

operator 34 

delta 85 

nabla 85 
order 31 
outer product 32 

e-production 118 

external 128 

internal 128 
parallel 

independence 44 

production 45 
Parikh vector 211 
parity 38 
permutation 38 
Petri net 210 

conservative 227 

definition 210 

pure 214 
place 210 
positive 

application condition 47 
atomic 47 

graph constraint 47 
atomic 47 
postcondition 47 

MGG 168 

weak 168 
precondition 47 

MGG 168 

weak 168 
production 



e 118 
DPO 42 

dynamic formulation 99 
SPO 49 

static formulation 68 
propositional logic 15 
pullback 22 
pullout 65 
pushout 22 

complement 23 

initial 23 

7?.-structure 60 
rank 31 

reachability 8, 210, 215 
relation 62 

equivalence 76 

universal 64 

zero 64 
RHS, Right Hand Side 71 
Riesz representation theorem 35 
rule scheme 199 

scalar product 34 

second order logic, SOL 17 

sequence 79 

sequential confluence 10 

sequential independence 8, 45 

generalization 148, 153 

weak 50 
signature 38 
simple 

digraph 27 

node 200 
single 



Index 279 



pullback (SPB) 51 
pushout (SPO) 49 
source 20 

state equation 211, 225 
string 57 

length 57 
subgroup 38 
substitution function 199 
synthesis of a derivation 46 

target 20 
tensor 31 
product 32 
for graphs 29 
terminal object 19 
termination 259 
token 210 
transduction 60 



transformation (HLR systems) 48 
transition 210 

enabled 210 

firing 210 
transposition 38 

even 38 

odd 38 
true concurrency 156 
type 76 

universal property 20 
valence 31 

Van Kampen square 24 

weak parallell independence 45 
well-definedness 166 

■ET-production 204 



